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Summary 

In  constructing  the  suite  of  tools  for  the  real-time  delta-sigma  multibeam 
processing,  we  performed  the  following  steps. 

1.  The  algorithm  for  the  design  of  antenna  arrays  (corresponding  to 
user-prescribed  antenna  patterns)  has  been  improved.  While  certain  gen¬ 
eralizations  of  classical  results  (Markov,  Karlin,  et  alii)  state  that  certain 
antenna  structures  exist,  algorithms  for  constructing  such  structures  often 
have  a  heuristic  element.  Our  older  schemes  have  a  non-trivial  failure  rate. 
The  rate  has  been  sufficiently  low  that  practically  any  feasible  structure  can 
be  designed,  and  sufficiently  high  to  require  human  intervention  and  make 
the  design  a  potentially  time-consuming  process. 

The  new  algorithm  has  a  negligible  failure  rate,  and  the  very  definition 
of  failure  is  much  more  benign:  instead  of  failing  to  produce  the  required 
structure,  the  scheme  (very  rarely)  returns  a  structure  containing  one  extra 
element  (compared  to  the  optimal  one). 

2.  When  a  single  antenna  array  has  to  perform  several  tasks,  the  problem 
is  (at  the  present  time)  approached  via  simple  scheduling:  the  tasks  are  or¬ 
dered  in  terms  of  their  urgency,  and  performed  one  after  another.  Obviously, 
some  of  the  tasks  can  be  performed  simultaneously,  either  with  the  existing 
hardware  or  with  very  minor  modifications.  Other  tasks  might  be  fundamen¬ 
tally  incompatible,  and  have  to  be  scheduled  no  matter  what  equipment  is 
to  be  used.  We  have  developed  criteria  (applied  to  far-field  patterns)  that 
permit  us  to  avoid  attempts  to  design  impossible  structures.  There  appears 
to  be  a  chance  that  such  techniques  could  be  used  for  real-time  scheduling. 

3.  We  discovered  that  the  CPU  time  requirements  associated  with  the 
design  of  the  required  patterns  for  real-life  antenna  arrays  (assuming  the 
use  of  classical  algorithms  from  linear  algebra)  exceeds  the  capabilities  of 
the  available  computer  hardware;  Moore’s  law  alone  is  unlikely  to  alter  this 
situation  in  the  near  future.  This  had  been  expected,  and  the  effort  was 
initiated  to  design  and  build  the  requisite  analytical  and  numerical  tools. 

4.  We  discovered  a  new  class  of  algorithms  for  the  “compression”  of  low- 
rank  matrices;  we  refer  to  the  approach  as  “skeletonization”.  It  is  somewhat 
similar  to  the  classical  singular  value  decomposition  (SVD)  but  requires  con¬ 
siderably  less  CPU  time  to  construct,  and  leads  to  schemes  for  the  direct 
(as  opposed  to  iterative)  solution  of  systems  of  linear  algebraic  equations  as- 
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sociated  with  the  numerical  scattering  theory,  design  of  antenna  arrays  and 
patterns,  etc. 

5.  It  has  been  observed  that  the  new  approach  might  lead  to  new  “fast” 
algorithms  for  the  diagonalization  of  (and  for  the  construction  of  the  SVD 
for)  certain  classes  of  matrices,  including  those  encountered  in  the  design  of 
optimal  antenna  patterns  for  conformal  and  volume  antenna  arrays.  Rudi¬ 
mentary  codes  of  this  type  have  been  constructed. 
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Whenever  physical  signals  are  measured  or  generated,  the  locations  of  receivers  or  trans¬ 
ducers  have  to  be  selected.  Most  of  the  time,  this  appears  to  be  done  on  an  ad  hoc  basis. 
For  example,  when  a  string  of  geophones  is  used  in  the  measurements  of  seismic  data  in 
oil  exploration,  the  receivers  are  located  at  equispaced  points  on  an  interval.  When  phased 
array  antennae  are  constructed,  their  shapes  are  determined  by  certain  aperture  consid¬ 
erations;  round  and  rectangular  shapes  are  common.  When  antenna  beams  are  steered 
electronically,  it  is  done  by  changing  the  phases  (and  sometimes,  the  amplitudes)  of  the 
transducers.  Again,  these  transducers  are  located  in  a  region  of  predetermined  geometry, 
and  their  actual  locations  within  that  geometry  are  chosen  via  some  heuristic  procedure. 

In  all  these  (and  many  other)  cases,  the  signals  being  received  or  generated  are  band-limited. 
Optimal  representation  of  such  signals  has  been  studied  in  detail  by  Slepian  et.  al.  more 
than  30  years  ago,  and  some  of  the  obtained  results  were  applied  by  D.  Rhodes  to  the 
design  of  antenna  patterns;  further  development  of  this  line  of  research  appears  to  have 
been  hindered  by  the  absence  at  the  time  of  necessary  numerical  tools.  We  combine  these 
classical  results  with  the  recently  developed  apparatus  of  Generalized  Gaussian  Quadratures 
to  construct  optimal  nodes  for  the  measurement  and  generation  of  band-limited  signals.  In 
this  report,  we  describe  the  procedure  based  on  these  techniques  for  the  design  of  such 
receiver  (and  transducer)  configurations  in  a  variety  of  environments. 


A  Procedure  for  the  Design  of  Apparata  for  the 
Measurement  and  Generation  of  Band-Limited  Signals 
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1  Introduction 


When  measurements  are  performed,  it  often  happens  that  the  signal  to  be  measured  is 
well  approximated  by  linear  combinations  of  oscillatory  exponentials,  i.e.  functions  of 
the  form 


n 


.  ei  Xj  'x 


in  one  dimension,  of  the  form 


(1) 


(2) 

3  =  1 

in  two  dimensions,  and  of  the  form 

f^CXj  .eH^-x+^-y+uyz)  (3) 

j=l 

in  three  dimensions.  In  most  cases,  the  signal  is  band-limited,  i.e.  there  exist  such  real 
positive  a  that  all  1  <  j  <  n, 


Xj  |<  a 


(4) 


in  one  dimension, 

Aj  +  (5) 

in  two  dimensions,  and 

Xj  +  +  v2  <  a2,  (6) 

in  three  dimensions. 

As  is  well-known,  most  measurements  of  electromagnetic  and  acoustic  data  (espe¬ 
cially  at  reasonably  high  frequencies)  are  of  this  form.  Examples  of  such  situations 
include  geophone  and  hydrophone  strings  in  geophysics,  phased  array  antennae  in  radar 
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systems,  multiple  transceivers  in  ultrasound  imaging,  and  a  number  of  other  applications 
in  astrophysics,  medical  imaging,  non-destructive  testing,  etc. 

In  this  report,  we  describe  a  procedure  for  determining  the  optimal  distribution  of 
sources  and  receivers  that  maximizes  accuracy  and  resolution  in  measuring  band-limited 
data  given  a  fixed  number  of  receivers.  Alternatively,  the  procedure  can  be  used  to 
determine  the  optimal  distribution  of  receivers  that  will  minimize  their  number  given 
specified  accuracy  and  resolution.  While  the  techniques  described  in  this  note  are  fairly 
general,  we  describe  them  in  detail  in  the  case  of  linear  antenna  arrays;  the  changes 
needed  to  generalize  the  approach  to  other  cases  are  summarized  in  Section  6. 

Remark  1.1  One  of  principal  issues  in  the  design  of  antenna  arrays  is  the  treatment 
(or  avoidance)  of  the  so-called  supergain  (or  superdirectivity).  Supergain  is  the  con¬ 
dition  that  occurs  when  an  antenna  design  is  attempted  that  is  prohibited  (or  nearly 
prohibited)  by  the  Heisenberg  principle;  technically,  it  occurs  in  the  form  of  very  closely 
spaced  elements  operating  out  of  phaze,  and  leads  to  prohibitive  Ohmic  losses  in  trans¬ 
mitting  antennae,  loss  of  sensitivity  in  receiving  ones,  etc.  Since  the  purpose  of  this 
note  is  to  introduce  techniques  for  selecting  the  locations  of  elements  for  a  prescribed 
antenna  pattern,  we  avoid  the  issue  of  choosing  the  antenna  pattern  altogether.  Instead, 
we  observe  design  optimal  element  distributions  for  several  standard  far-field  patterns 
(see  Section  5.1),  and  we  observe  that  the  scheme  for  choosing  optimal  distributions  of 
elements  is  virtually  independent  of  the  patterns  being  approximated. 

Technically,  the  approach  taken  here  is  to  observe  that  designing  an  antenna  array 
can  be  viewed  as  constructing  a  quadrature  formula  for  the  integration  of  certain  special 
classes  of  functions.  Using  recently  developed  techniques  for  the  construction  of  so-called 
Generalized  Gaussian  Quadratures,  we  obtain  both  nodes  and  weights  that  are  optimal 
(in  a  very  strong  sense)  for  the  required  antenna  pattern. 

The  structure  of  this  note  is  as  follows.  In  Section  2,  we  summarize  some  of  the  math¬ 
ematical  apparatus  to  be  used:  Chebychev  Systems,  Generalized  Gaussian  Quadratures, 
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etc.  In  Section  3,  we  recapitulate  some  of  the  standard  antenna  theory,  primarily  to 
introduce  the  necessary  notation.  In  Section  4,  element  distributions  given  a  specific  an¬ 
tenna  pattern.  In  Section  5,  we  illustrate  our  approach  with  several  numerical  examples, 
and  Section  6  contains  a  discussion  of  the  generality  of  the  schemes  presented. 


2  Analytical  Preliminaries 

In  this  section,  we  summarize  several  known  facts  about  classical  Special  functions.  All 
of  these  facts  can  be  found  in  the  literature;  detailed  references  are  given  in  the  text. 


2.1  Chebyshev  systems 


Definition  2.1  A  sequence  of  functions  fa,...,  fa  will  be  referred  to  as  a  Chebyshev 
system  on  the  interval  [a,  6]  if  each  of  them  is  continuous  and  the  determinant 


fa{x\)  •••  fa{xn) 

4*n  (Xl)  •  1  •  fa(xn) 

is  nonzero  for  any  sequence  of  points  X\, . . . ,  xn  such  that  a  <  X\  <  x^  . . .  <  xn  <  b. 


An  alternate  definition  of  a  Chebyshev  system  is  that  any  linear  combination  of  the 
functions  with  nonzero  coefficients  must  have  no  more  than  n  zeros. 

Examples  of  Chebyshev  and  extended  Chebyshev  systems  include  the  following  (ad¬ 
ditional  examples  can  be  found  in  [8]). 

Example  2.1  The  powers  l,x,x2, . . .  ,xn  form  an  extended  Chebyshev  system  on  the 
interval  (— oo,  oo). 

Example  2.2  The  exponentials  e~XlX,  e~X2X, . . . ,  e~XnX  form  an  extended  Chebyshev  sys¬ 
tem  for  any  Ai, . . . ,  An  >  0  on  the  interval  [0,  oo). 

Example  2.3  The  functions  1 ,  cos  x,  sin  x,  cos  2x,  sin  2x, . . . ,  cos  nx,  sin  nx  form  a  Cheby¬ 
shev  system  on  the  interval  [0,  27t]. 
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Example  2.4  Suppose  that  c>  0  is  a  real  number,  w  is  a  positive  function  [—1, 1]  — >■  R 
such  that  w  €  c2[— 1, 1]  and  w(-x)  =  w(x)  for  all  x  €  [-1,1],  n  is  a  natural  number, 
and  the  operators  P,  Q  :  L2[- 1, 1]  ->  L2[— 1, 1]  are  defined  by  the  formulae 

P(<l>)(x)  =  Jl  w(t)  •  ei  c  x  t  ■  dt  (8) 

Q  =  P*  o  P.  (9) 

Suppose  further  that  fa,<t> 2)...  are  the  eigenfunctions  of  Q,  Ai,A2)...  are  the  corre¬ 
sponding  eigenvalues,  and  Ai  >  A2  >  A3....  Then  all  eigenfunctions  of  Q  (also  known 
as  the  right  singular  vectors  of  P)  can  be  chosen  to  be  real.  Furthermore,  the  functions 
(f>i,  <f>2, . . . ,  (f>n  constitute  a  Chebychev  system  on  the  interval  [—1,1]. 

2.2  Generalized  Gaussian  quadratures 

A  quadrature  rule  is  an  expression  of  the  form 

(fix,),  (10) 

j=l 

where  the  points  Xj  €  R  and  coefficients  Wj  €  R  are  referred  to  as  the  nodes  and  weights 
of  the  quadrature,  respectively.  They  serve  as  approximations  to  integrals  of  the  form 

[  fax)  •  ui(x)dx  (11) 

Ja 

with  u>  is  an  integrable  non-negative  function. 

Quadratures  are  typically  chosen  so  that  the  quadrature  (10)  is  equal  to  the  desired 
integral  (11)  for  some  set  of  functions,  commonly  polynomials  of  some  fixed  order.  Of 
these,  the  classical  Gaussian  quadrature  rules  consist  of  n  nodes  and  integrate  polynomi¬ 
als  of  order  2n  -  1  exactly.  In  [13],  the  notion  of  a  Gaussian  quadrature  was  generalized 
as  follows: 

Definition  2.2  A  quadrature  formula  will  be  referred  to  as  Gaussian  with  respect  to  a 
set  of  2 n  functions  <f> i, . . . ,  <^2n  :  [a,  6]  — >  R  and  a  weight  function  u  :  [a,  6]  — >  R+,  if  it 
consists  of  n  weights  and  nodes,  and  integrates  the  functions  fa  exactly  with  the  weight 
function  u  for  alii  —  1, . . . ,  2n.  The  weights  and  nodes  of  a  Gaussian  quadrature  will  be 
referred  to  as  Gaussian  weights  and  nodes  respectively. 
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The  following  theorem  appears  to  be  due  to  Markov  [15,  16];  proofs  of  it  can  also  be 
found  in  [10]  and  [8]  (in  a  somewhat  different  form). 

Theorem  2.1  Suppose  that  the  functions  0i,...,02n  ■  [a,  b]  — *•  R  form  a  Chebyshev 
system  on  [a,  b ] .  Suppose  in  addition  that  u  :  [a,  b]  — >  R  is  a  non-negative  integrable 
function  [a,  6]  — »•  R.  Then  there  exists  a  unique  Gaussian  quadrature  for  the  functions 
0i, ,  02 n  on  [o,  b]  with  respect  to  the  weight  function  u.  The  weights  of  this  quadrature 
are  positive. 

Remark  2.1  While  the  existence  of  Generalized  Gaussian  Quadratures  was  observed 
more  than  100  years  ago,  the  constructions  found  in  [15,  16],  [3,  10],  [7,  8]  do  not  easily 
yield  numerical  algorithms  for  the  design  of  such  quadrature  formulae;  such  algorithms 
have  been  constructed  recently  (see  [13,  28,  2]).  The  version  of  the  procedure  found  in 
[2]  was  used  to  produce  the  results  presented  in  the  Examples  5.1,  5.2,  5.3  in  Section  5.1; 
the  reader  is  referred  to  [2]  for  details. 

Applying  Theorem  2.1  to  the  Example  2.4,  we  obtain  the  following  theorem. 

Theorem  2.2  Suppose  that  under  the  conditions  of  Example  24,  n  is  even.  Then 
there  exist  n/2  points  ti,t2,  ■  •  •  4n/2  on  the  interval  [—1,1]  and  positive  real  numbers 
Wi,  W2,  •  •  • ,  wnj 2  such  that 

\  n/2 

/  w{t)  •  4>i(t)  dt  =  J2wr  Mtj)>  (12) 

-i 

for  all  i  =  1,2 with  0i,02,...,0n  the  first  n  eigenfunctions  of  the  operator  Q 
defined  in  (9). 

Corollary  2.3  The  above  theorem  provides  a  tool  for  the  efficient  approximate  evalua¬ 
tion  of  integrals  of  the  form  (12),  as  follows.  Given  a  positive  real  e,  we  construct  the 
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Singular  Value  Decomposition  of  the  operator  P  defined  in  (8).  Choosing  n  to  be  the 

smallest  even  integer  such  that 
00 

£  (13) 

j=n+l 

we  construct  an  nj 2-point  quadrature  that  integrates  n  first  right  singular  functions  ex¬ 
actly  (effective  numerical  schemes  for  the  construction  of  such  quadratures  can  be  found 
in  [13,  28,  2]).  Now,  we  observe  that  due  to  the  triangle  inequality  combined  with  the 
positivity  of  the  obtained  weights  w\,  u>2,  •  •  • ,  Wn/2> 


for  any  x  6  [—1, 1]. 

Remark  2.2  The  principal  subject  of  this  note  is  the  fact  that  the  pattern  of  an  antenna 
array  is  formed  by  a  physical  process  amounting  to  a  hardware  implementation  of  a 
quadrature  formula  for  functions  of  the  form  (9).  Thus,  designing  a  configuration  of 
elements  for  such  an  antenna  is  equivalent  to  constructing  a  quadrature  formula  for 
functions  of  the  form(  9),  and  can  be  achieved  via  the  techniques  described  in  [13,  28,  2]). 

3  Elements  of  Antenna  Theory 

In  this  section,  we  summarize  certain  facts  about  the  theory  of  linear  antenna  arrays;  all 
of  these  facts  are  well-known,  and  can  be  found,  for  example,  in  [9]. 

3.1  Pattern  of  a  linear  array 

A  source  distribution  a  on  the  interval  [—1,1]  creates  the  far-field  pattern  /  :  [0, 7r]  — >  C 
given  by  the  formula 

x 

f(9)  =  J  a  (u)  ■  ei  k  u-cos{6)  du,  (15) 

-l 
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where  k  is  the  free-space  wavenumber,  u  is  the  point  on  the  interval  [—1,1],  and  6  is  the 
angle  between  the  point  on  the  horizon  where  the  far  field  is  being  evaluated  and  the 
x-axis.  It  is  customary  to  introduce  the  notation 

x  =  cos(6),  (16) 

and  define  the  function  F  :  [-1, 1]  — >  C  by  the  formula 

F{x)  -  f(acos(x)).  (17) 

Now,  defining  the  operator  A  :  L2[—  1, 1]  — »  L2[—  1, 1]  by  the  formula 

i 

A(a)(x)  =  J  a (u)  ■  el'k'ux  du,  (18) 

-l 

we  observe  that 

i 

F  =  A{a)  =  j  o{u)  •  ei  k  u  x  du.  (19) 

-l 

The  function  F  is  usually  more  convenient  to  work  with  than  /,  and  the  following  obvious 
lemma  is  the  principal  reason  for  this  difference. 

Lemma  3.1  Suppose  that  a  G  L2[—  1, 1],  the  function  F  G  L2[— 1, 1]  is  defined  by  (19), 
a  is  a  real  number,  and  the  function  a  G  L2[— 1,1]  is  defined  by  the  formula 

a{u)  =  el'a  u  •  a(u).  (20) 

Then 

A(a)(x)  =  A(a)(x  —  a)  (21) 

for  all  x  G  (— oo,  oo).  In  other  words,  in  order  to  translate  the  antenna  pattern  F  (viewed 
as  a  function  of  x  =  cos(9)  )  by  a,  one  has  to  multiply  by  el'a  k  the  source  distribution  a 
generating  the  pattern  F. 
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Observation  3.1  While  the  obvious  physical  considerations  lead  to  the  antenna  pattern 
F  defined  on  the  interval  [-1,1],  the  formulae  (15),  (17)  also  define  naturally  the  exten¬ 
sion  of  F  to  the  function  R  -4  C;  in  a  mild  abuse  of  notation,  we  will  be  denoting  by  F 
both  the  original  mapping  [-1,1]  -4  C  and  its  extension  to  the  mapping  Et  ->  C.  Simi¬ 
larly,  we  will  be  denoting  by  A  both  the  operator  L2[-l,l]  -4  L2[-l,l]  defined  by  (18) 
and  its  natural  extension  mapping  L2[- 1, 1]  -4  c°°(R).  The  restriction  of  F  on  R\[-l,  1] 
is  referred  to  as  the  invisible  spectrum  of  the  source  distribution  a  and  plays  an  important 
role  in  the  antenna  theory  (this  role  is  discussed  briefly  in  the  following  subsection).  By 
the  same  token,  the  restriction  of  F  on  the  interval  [-1, 1]  is  referred  to  as  the  visible 
spectrum. 

When  an  antenna  array  is  implemented  in  hardware,  it  is  (usually)  constructed  of 
a  finite  collection  of  elements,  as  opposed  to  being  a  continuous  source  distribution. 
Mathematically,  it  is  equivalent  to  replacing  the  general  function  a  in  (15),  (19)  with  a 
defined  by  the  expression 

o{x)  =  lift'  4>j{u),  (22) 

with  ■  ■  ■  An  the  source  distributions  generated  by  individual  elements,  and  the 

coefficients  ft,  ft,..., ft  the  intensities  of  the  elements.  As  a  rule,  the  elements  are 
localized  in  space  (i.e.  the  functions  <f>x,  fc, . . . ,  (f>n  are  supported  on  small  subintervals 
of  [-1,1]),  and  very  often,  all  of  the  elements  are  identical  (i.e.  the  functions  <t>j  are 
translates  of  each  other),  so  that 

4>j{u)  =  (j){u  —  Uj),  (23) 

with  f>  the  source  distribution  of  a  single  element  located  at  the  point  u  =  0,  and  Uj  the 
location  of  the  element  number  j.  Obviously,  the  far-field  pattern  of  <f>  is  given  by  the 
formula 

i 

F*(x)  =  j  <i>{u)-ei  k  u  x  dm,  (24) 

-l 
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(25) 


combining  (24)  with  (22)  and  (23),  we  obtain  the  identity 

J.  n 

a(x)  =  /  <f>{u)  ■  ei'k'u  x  du  •£/?,••  ei  k'UyX, 

-i 

known  in  the  antenna  theory  as  the  principle  of  pattern  multiplication. 


Remark  3.2  The  standard  form  of  the  principle  of  multiplication  reads:  “The  field 
pattern  of  an  array  of  nonisotropic  but  similar  point  sources  is  the  product  of  the  pattern 
of  the  individual  source  and  the  the  pattern  of  an  array  of  isotropic  point  sources,  having 
the  same  locations,  relative  amplitudes  and  phases  as  the  nonisotropic  point  sources”  (see 
[9]).  Needless  to  say,  this  is  a  special  case  of  the  well-known  theorem  from  the  theory  of 
the  Fourier  Transform,  stating  that  the  Fourier  transform  of  the  product  of  two  functions 
is  the  convolution  of  the  Fourier  Transforms  of  multiplicants. 


4  Antenna  Patterns  and  Corresponding  Optimal  El¬ 
ement  Distributions 

4.1  Characteristics  of  an  antenna  pattern 

Depending  on  the  situation,  the  design  of  an  antenna  array  attempts  to  optimize  certain 
characteristics  of  the  resulting  far-field  pattern,  subject  to  certain  constraints  on  the 
number,  power,  etc.  of  the  elements.  Since  the  principal  purpose  of  this  note  is  to 
describe  a  technique  for  the  selection  of  the  locations  of  the  elements  that  approximate  a 
user-specified  pattern,  we  could  use  any  reasonable  far-field  pattern  to  be  approximated. 
In  subsection  4.2,  4.3,  we  construct  optimal  element  distributions  for  the  so-called  sector 
patterns  and  cosecant  pattern,  respectively;  a  detailed  discussion  of  these  (and  several 
other)  pattern  cans  be  found,  for  example  in  [14]. 

We  will  say  that  the  antenna  pattern  has  the  e-bandwidth  b  if 

i 

J  \F(x)\2  dx  =  e2  ■  J  \F(x)\2  dx  (26) 

6<||x||<l  -1 
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in  other  words,  the  proportion  of  the  energy  radiated  outside  the  e-beamwidth  from  the 
axis  of  the  beam  is  equal  to  e.  The  supergain  of  an  antenna  is  defined  (see,  for  example, 
[27]),  as  the  ratio 

7>(x)|2  dx 

=? - .  (27) 

I  |F(x)|2  dx 

-l 

The  supergain  (sometimes  referred  to  as  superdirectivity)  measures  the  ratio  of  the  en¬ 
ergy  associated  with  the  total  spectrum  of  the  antenna  to  the  energy  in  its  visible  spec¬ 
trum;  while  detailed  discussion  of  supergain  and  related  issues  is  outside  the  scope  of  this 
note,  we  will  observe  that  antenna  arrays  with  large  degrees  of  supergain  would  violate 
the  uncertainty  principle,  and  thus  are  physically  impossible.  Attempts  to  construct 
supergain  antennae  result  in  rapidly  (exponentially)  growing  Ohmic  losses,  prohibitive 
accuracy  requirements,  extremely  low  bandwidth,  etc.  Thus,  any  potentially  useful  pro¬ 
cedure  for  the  design  of  antenna  arrays  has  to  limit  the  supergain  of  the  resulting  patterns. 


4.2  Sector  patterns 

It  is  often  desirable  to  construct  antenna  patterns  that  are  as  constant  as  possible  within 
the  main  beam,  and  as  small  as  possible  outside  it;  in  other  words,  ideally,  the  pattern 
would  be  defined  by  the  formulae 


Fb(:r)  =  1  for  |rr|  <  6, 

(28) 

Fb(x)  =  0  for  |rr|  >  b, 

(29) 

with  b  a  real  number  such  that  0  <  b  <  k.  Needless  to  say,  the  function  Fb  defined  by 
the  formulae  (28),  (29)  is  not  band-limited,  and  some  approximation  has  to  be  used.  A 
standard  procedure  is  to  truncate  the  Fourier  Transform  of  Fb,  approximating  it  by  the 
function  Fb  defined  by  the  formula 


T“1  /  \  / 


sin(b-t)  e, 


(30) 


(see,  for  example,  [26]).  An  important  special  case  occurs  when  b  =  k,  with  (30)  assuming 
the  form 


=  L 


sin(k  •  t)  j.k.x.t 
t  ’ 


(31) 


obviously,  the  latter  expression  is  a  band-limited  approximation  of  the  (5-function.  An¬ 
other  frequently  encountered  situation  is  that  of  b  —  k/2,  so  that  (30)  assumes  the  form 


*<*> = L 


fk 

v2 

t 


$in(2  •  t)  .  i'k'X't 
£  1 


(32) 


which  is  a  band-limited  approximation  to  the  beam  that  is  equal  to  1  for  —1/2  <  x  <  1/2 
and  to  zero  elsewhere. 

In  Section  4.4  below,  we  demonstrate  optimal  element  configurations  that  produce 
approximations  to  the  patterns  (31),  (32)  with  k  =  207T,  107T,  32.46767T. 


Remark  4.1  While  (30)  is  by  no  means  the  only  possible  band-limited  approximations 
to  to  Fb,  it  is  quite  satisfactory  in  most  cases,  in  addition  to  being  simple.  Furthermore, 
the  principal  purpose  of  this  note  is  to  describe  a  technique  for  the  selection  of  locations 
of  the  nodes,  given  a  pattern  to  be  approximated.  Thus,  we  ignore  the  issue  of  the 
optimal  choice  of  Fb. 


4.3  Cosecant  patterns 

Another  standard  far-field  radiation  pattern  is  the  so-called  cosecant  pattern  (see,  for 
example,  [19]).  Given  two  real  numbers  0  <  a  <  b  <  1,  the  cosecant  pattern  Fa>b  is 
defined  by  the  formula 

Fa,b(x)  =  -  (33) 

x 

for  all  x  €  [a,  b],  and 

Fajk  (x)  =  o  (34) 
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for  all  x  G  ([-1, 1]  \  [a,  b ]).  Again,  the  function  Fa>b  defined  by  the  formulae  (33),  (34)  is 
not  band-limited,  and  can  not  be  represented  by  the  expression  of  the  form  (24).  Before 
the  scheme  of  this  note  can  be  applied  to  Fa,b,  the  latter  has  to  be  approximated  with  a 
band-limited  function;  as  discussed  in  Section  4.1  above,  if  such  an  approximation  is  to 
be  useful  as  an  antenna  pattern,  its  supergain  factor  has  to  be  controlled.  Fortunately, 
a  procedure  for  such  an  approximation  has  been  in  existence  for  more  than  35  years 
(see,  [18]);  the  algorithm  of  [18]  is  a  modification  of  the  least-squares  approach  permitting 
the  user  to  limit  the  supergain  factor  of  the  obtained  pattern  explicitly.  At  the  time,  the 
utility  of  the  scheme  of  [18]  was  limited  by  the  (perceived)  difficulty  in  the  numerical 
evaluation  of  Prolate  Spheroidal  Wave  functions;  given  the  present  state  of  numerical 
analysis,  this  difficulty  is  non-existent,  and  it  is  this  author’s  impression  that  the  insights 
of  [18],  [19]  deserve  more  attention  than  they  have  been  receiving. 

4.4  Optimal  distributions  of  elements 

In  this  subsection,  we  briefly  describe  an  algorithm  for  the  construction  of  optimal  (in 
the  sense  defined  below)  element  configurations  for  the  generation  of  antenna  patterns 
given  by  (15),  of  which  the  patterns  (29)-(31)  are  special  cases.  As  will  be  seen,  the 
procedure  is  in  fact  applicable  to  the  design  of  element  configurations  for  very  general 
far-field  patterns. 

We  start  with  observing  that  (15)  expresses  the  far-field  pattern  F  as  an  integral  over 
the  interval  [—1,1]  of  functions  of  the  form 

<r(u)  •  ei  fc  x  u,  (35) 

with  x  =  cos{6 )  determined  by  the  direction  6  in  which  the  far-field  is  being  evaluated.  In 
other  words,  the  problem  of  finding  efficient  antenna  element  distributions  is  equivalent 
to  that  of  constructing  quadrature  formulae  for  integrals  of  the  form  (8),  with 

w(t)  =  a(t).  (36) 
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In  the  cases  when  o  is  non-negative  everywhere  on  the  interval  [—1,1],  Theorem  2.2 
guarantees  the  existence  of  Generalized  Gaussian  Quadratures,  and  [13,  28])  provide  a 
satisfactory  numerical  apparatus  for  the  construction  of  such  quadratures.  Obviously,  the 
patterns  given  by  the  formula  (28)  are  not  generated  by  non-negative  source  distributions, 
except  when 

b  <  7r.  (37) 

Thus,  for  these  (and  many  other)  patterns,  the  conditions  of  Theorem  2.2  are  violated, 
and  the  existence  of  Generalized  Gaussian  Quadratures  is  not  guaranteed.  In  our  numer¬ 
ical  experiments,  the  techniques  of  [2])  (after  some  tuning)  have  always  been  successful 
in  finding  the  Gaussian  quadratures  for  integrals  of  the  form  (28);  some  of  our  results 
are  presented  in  Section  5  below. 

5  Numerical  Examples 

In  this  section,  we  present  examples  of  optimal  element  distributions  generating  the 
patterns  of  the  preceding  Section;  all  of  the  results  presented  here  have  been  obtained 
numerically.  Antenna  patterns  we  present  are  compared  to  the  antenna  patterns  given 
by  uniform  source  distributions;  configurations  of  elements  approximating  these  antenna 
patterns  are  compared  to  equispaced  distributions  of  elements  generating  the  same  an¬ 
tenna  patterns. 

5.1  Optimal  distributions  of  elements 

In  this  section,  we  demonstrate  the  results  of  the  application  of  the  techniques  of  Sec¬ 
tion  4.4  of  this  note  to  the  types  of  antenna  patterns  described  in  the  Sections  4.2,  4.3. 

In  all  cases,  we  choose  the  size  of  an  antenna  array  and  a  pattern  to  be  reproduced,  and 
use  the  scheme  outlined  in  Section  4.4  to  design  a  distribution  of  antenna  elements  (both 
the  locations  and  the  intensities)  located  within  the  chosen  array  that  reproduces  the 
required  pattern.  For  comparison,  we  also  generate  optimal  (in  the  least  squares  sense) 
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approximations  to  the  desired  pattern  generated  by  equispaced  elements  located  within 
the  same  array.  Since  the  number  of  equispaced  nodes  required  to  obtain  a  reasonable 
approximation  to  the  desired  pattern  is  (in  many  cases)  much  greater  than  the  number  of 
optimally  chosen  nodes,  for  each  example  we  demonstrate  patterns  generated  by  several 
such  configurations.  In  this  manner,  the  numbers  of  optimally  chosen  nodes  necessary 
to  obtain  reasonable  approximations  to  the  desired  patterns  can  be  compared  to  the 
numbers  of  equispaced  nodes  required  to  obtain  similar  results. 

5.1.1  Sector  patterns 

Example  5.1  The  first  example  we  consider  is  of  the  pattern  defined  by  the  formula  (32), 
with  k  =  62.8312,  so  that  the  size  of  the  array  is  20  wavelengths. 

In  Figure  5,  we  display  an  approximation  to  the  pattern  obtained  with  19  elements, 
overlayed  with  the  exact  pattern;  the  locations  of  the  elements  are  displayed  in  Figure  5a; 
the  relative  error  of  the  obtained  approximation  is  5.01%. 

Similarly,  in  Figure  5g,  we  display  the  approximation  to  the  pattern  obtained  with  21 
elements,  overlayed  with  the  exact  pattern;  the  relative  error  of  the  obtained  approxima¬ 
tion  is  0.443%;  in  Figure  5h,  we  display  the  the  approximation  obtained  with  17  elements. 
In  the  latter  case,  the  relative  error  of  the  obtained  approximation  is  6.43%;  Figure  5i 
depicts  the  17 -node  distribution  producing  the  approximation  illustrated  in  Figure  5h. 
Finally,  Figure  5j  contains  a  graph  of  the  values  of  the  sources  located  at  the  17  nodes 
depicted  in  Figure  5i  and  generating  the  pattern  shown  in  Figure  5h. 

For  comparison,  the  optimal  approximation  obtained  with  19,  24,  29,  31,  and  34 
equispaced  elements  are  displayed  in  Figures  5b,  5c,  5d,  5e,  5f,  respectively;  these  are 
also  overlayed  with  the  exact  pattern. 

Example  5.2  Our  second  example  is  identical  to  the  first  one,  with  the  exception  that 
k  =  31.416,  so  that  the  size  of  the  array  is  10  wavelengths. 

In  Figure  6,  we  display  an  approximation  to  the  pattern  obtained  with  9  elements, 
overlayed  with  the  exact  pattern;  the  locations  of  the  elements  are  displayed  in  Figure  6a; 
the  relative  error  of  the  obtained  approximation  is  11.2%. 
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Similarly,  in  Figure  6f,  we  display  the  approximation  to  the  pattern  obtained  with  11 
elements,  overlayed  with  the  exact  pattern;  the  relative  error  of  the  obtained  approxima¬ 
tion  is  0.600%. 

For  comparison,  the  optimal  approximation  obtained  with  9,  14,  16,  and  18  equispaced 
elements  are  displayed  in  Figures  6b,  6c,  6d,  5e,  respectively;  these  are  also  overlayed 
with  the  exact  pattern. 

Example  5.3  Our  third  example  is  identical  to  the  preceding  two,  with  the  exception 
that  k  =  102,  so  that  the  size  of  the  array  is  about  32.45  wavelengths. 

In  Figure  la,  we  display  an  approximation  to  the  pattern  obtained  with  23  optimally 
distributed  elements,  overlayed  with  the  exact  pattern  and  with  the  pattern  obtained  with 
23  equispaced  elements. 

The  relative  error  of  the  obtained  approximation  is  5.4%;  needless  to  say,  the  error  of 
the  approximation  obtained  with  the  equispaced  nodes  is  more  than  70%.  As  can  be  seen 
from  Figure  1c,  the  actual  size  of  the  obtained  23 -element  array  is  about  21  wavelengths; 
in  other  words,  in  order  to  obtain  this  precision,  the  array  needs  to  be  about  2/3  of  the 
nominal  (maximum  permitted)  length. 

In  Figure  7b,  we  display  the  approximation  to  the  pattern  obtained  with  42  and  48 
elements,  overlayed  with  the  exact  pattern. 

It  is  worth  noting  that  with  33  optimally  distributed  elements,  the  pattern  is  approxi¬ 
mated  to  the  precision  0.12%;  we  do  not  display  the  obtained  pattern  since  it  is  visually 
indistinguishable  from  the  pattern  being  approximated. 

Example  5.4  Our  final  example  is  somewhat  different  from  the  preceding  ones,  in  that 
instead  of  approximating  a  sector  pattern,  we  approximate  a  cosecant  pattern  (see  (33),  (34) 
in  Subsection  4-3  above). 

In  this  example,  we  set 

a  =  sin( 15°),  (38) 
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b  =  sin(  75°), 


(39) 


and  use  the  procedure  of  [18]  to  approximate  Fa>b  with  a  band-limited  function.  The  band- 
limit  has  been  more  or  less  arbitrarily  set  to  110,  resulting  in  an  antenna  array  about  35 
wavelengths  in  size,  and  the  supergain  factor  of  the  approximation  was  set  to  1.1. 

In  Figure  8a,  we  display  an  approximation  to  the  pattern  obtained  with  53  optimally 
distributed  elements,  overlayed  with  the  exact  bandlimited  pattern  and  with  the  pattern 
obtained  with  53  equispaced  elements. 

The  relative  error  of  the  obtained  approximation  is  1.79%;  the  error  of  the  approxi¬ 
mation  obtained  with  the  equispaced  nodes  is  about  42%. 

In  Figure  8b,  we  display  the  approximation  to  the  pattern  obtained  with  47  optimally 
distributed  elements,  overlayed  with  the  exact  pattern;  the  purpose  of  this  final  figure  is 
to  demonstrate  the  behavior  of  the  scheme  when  the  number  of  elements  is  insufficient 
(i.e.  when  the  array  is  underresolved). 

It  is  worth  noting  that  it  takes  about  70  equispaced  nodes  to  obtain  the  resolution 
obtained  with  47  optimally  chosen  ones. 

The  following  observations  can  be  made  from  Figures  5  -  8b,  and  from  the  more 
detailed  numerical  experiments  performed  by  the  author. 

1.  In  order  to  obtain  reasonable  precision,  the  scheme  requires  about  1  point  per  wave¬ 
length  in  the  antenna  array;  this  is  more  or  less  independent  from  the  structure  of  the 
beam  as  long  as  the  pattern  is  symmetric  about  the  point  x  =  0.  This  fact  is  observed 
numerically,  even  for  modest  numbers  of  nodes;  for  large-scale  arrays,  this  statement 
(interpreted  asymptotically)  can  be  proved  rigorously.  For  certain  beam  structures,  the 
required  number  of  nodes  is  even  less  (see  Example  5.3).  The  reasons  for  these  additional 
savings  are  subtle,  and  have  to  do  with  the  fact  that  the  continuous  source  distribution 
generating  the  pattern  is  relatively  small  on  a  large  part  of  the  antenna  array;  the  al¬ 
gorithm  of  [2]  takes  advantage  of  this  fact  to  reduce  the  number  of  nodes.  When  the 
beam  is  not  symmetric  about  x  =  0,  the  number  of  elements  required  does  depend  on 
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the  structure  of  the  pattern,  and  the  dependence  is  fairly  complicated.  Generally,  the 
improvement  for  non-symmetric  beams  is  less  than  that  for  the  symmetric  ones. 

2.  The  qualiative  behavior  of  the  scheme  is  similar  to  that  of  the  Gaussian  quadratures 
in  that  it  displays  no  convergence  at  all  until  a  certain  minimum  number  of  nodes  is 
achieved;  after  that,  the  convergence  is  very  fast.  This  behavior  is  not  surprising,  since 
the  scheme  is  based  on  a  Generalized  Gaussian  quadrature. 

3.  For  the  sector  pattern  with  the  sector  [—1/2, 1/2],  the  scheme  reduces  the  required 
number  of  nodes  by  a  factor  of  about  1.5  for  small-scale  problems,  and  roughly  by  a 
factor  of  2  for  large-scale  ones;  again,  for  large-scale  problems,  an  asymptotic  version  of 
this  statement  can  be  proven  rigorously. 

4.  For  the  cosecant  pattern  with  the  parameters  specified  by  (38),  (39),  the  number 
of  nodes  required  is  reduced  by  approximately  a  factor  of  1.4.  As  the  sidelobe  level  is 
reduced,  the  improvement  obtained  by  going  from  the  equispaced  discretization  to  the 
optimal  one  increases  rapidly. 

5.  An  examination  of  Figures  5a,  6a  shows  that  while  the  optimal  nodes  are  by  no  means 
uniform,  they  display  no  clustering  behavior. 

6.  An  examination  of  Figure  5j  shows  that  the  intensities  of  individual  elements  do  not 
become  large;  this  is  confirmed  by  the  more  extensive  numerical  experiments  performed 
by  the  author. 

7.  The  combination  of  the  preceding  two  paragraphs  (combined  with  additional  numer¬ 
ical  experiments  and  analysis)  provide  evidence  that  configurations  of  this  type  should 
pose  no  supergain  problems. 

6  Generalizations 

The  results  described  above  admit  radical  generalizations  in  several  directions;  several 
such  directions  are  discussed  below, 
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1.  Conformal  one-dimensional  arrays.  The  extension  of  the  techniques  of  this  note 
to  one-dimensional  arrays  located  on  curves  in  R 3  is  completely  straightforward,  involving 
only  a  modest  increase  of  the  CPU  time  requirements  of  the  procedure.  Improvement  in 
the  number  of  nodes  required  to  produce  a  prescribed  pattern  is  similar  to  that  in  the 
case  of  a  linear  array. 

2.  Planar  two-dimensional  arrays.  A  straightforward  generalization  of  the  results  of 
Sections  4,  5,  is  to  rectangular  planar  arrays.  Here,  a  tensor  product  quadrature  can  be 
constructed  from  the  quadratures  of  Sections  4,  5,  possessing  all  of  the  desirable  prop¬ 
erties  of  the  latter.  Obviously,  the  advantage  in  the  number  of  transducers  is  squared, 
so  that  (for  example)  replacing  50  nodes  in  each  of  the  two  directions  by  23  nodes  (see 
Example  5.3  above)  will  lead  to  a  factor  of  (50/23)2  ~  4.7  savings  in  the  number  of 
elements. 

The  theory  of  Section  4  has  been  extended  for  disk-shaped  arrays,  via  ( inter  alia )  the 
techniques  developed  in  [23].  The  improvement  in  the  number  of  nodes  is  comparable  to 
that  obtained  in  the  rectangular  geometry,  and  the  CPU  time  requirements  do  not  differ 
appreciably  from  those  in  the  case  of  linear  one-dimensional  arrays. 

The  extension  of  the  theory  to  more  general  geometries  in  the  plane  is  in  progress.  At 
the  present  time,  our  only  numerical  experiments  have  been  with  arrays  on  triangles;  the 
results  are  encouraging,  but  the  CPU  time  requirements  of  the  algorithms  are  excessive 
(we  have  only  been  able  to  design  triangular  arrays  about  6  wavelengths  in  size).  We 
are  now  in  the  process  of  constructing  a  more  efficient  numerical  procedure  for  such 
computations. 

3.  Conformal  two-dimensional  arrays.  The  only  environment  in  which  we  have 
a  satisfactory  theory  is  when  the  array  is  located  on  a  surface  of  revolution;  even  in 
this  environment,  no  experiments  have  been  performed.  We  have  not  investigated  more 
general  conformal  two-dimensional  arrays  in  sufficient  detail. 


18 


1 


Figure  5b:  The  optimal  approximation  to  the  sector  pattern  generated  by  19 
equispaced  nodes,  as  described  in  Example  5.1 


Figure  5c:  The  optimal  approximation  to  the  sector  pattern  generated  by  24 
equispaced  nodes,  as  described  in  Example  5.1 
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Figure  5f:  The  optimal  approximation  to  the  sector  pattern  generated  by  34 
equispaced  nodes,  as  described  in  Example  5.1 


Figure  5g:  The  optimal  approximation  to  the  sector  pattern  generated  by  21 
optimal  nodes,  as  described  in  Example  5.1 
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Figure  5j:  The  values  of  the  sources  located  at  the  nodes  depicted  in  Figure  5i 
and  generating  the  pattern  depicted  in  Figure  5h,  as  described  in  Example  5.1 
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Figure  6b:  The  optimal  approximation  to  the  sector  pattern  generated  by  9 
equispaced  nodes,  as  described  in  Example  5.2 


Figure  6c:  The  optimal  approximation  to  the  sector  pattern  generated  by  14 
equispaced  nodes,  as  described  in  Example  5.2 
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Figure  6d:  The  optimal  approximation  to  the  sector  pattern  generated  by  16 
equispaced  nodes,  as  described  in  Example  5.2 


Figure  6e:  The  optimal  approximation  to  the  sector  pattern  generated  by  18 
equispaced  nodes,  as  described  in  Example  5.2 
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Figure  6f:  The  pattern  created  by  the  11  optimal  elements,  in  Example  5.2 
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Figure  7a:  The  approximation  to  the  sector  pattern  generated  by  23  optimal 
elements,  vs.  optimal  approximation  by  23  equispaced  nodes,  as  described  in 

Example  5.3 


Figure  7b:  The  optimal  approximations  to  the  sector  pattern  generated  by  42 
and  48  equispaced  nodes,  as  described  in  Example  5.3 
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Figure  8a:  The  approximation  to  the  cosecant  pattern  generated  by  53 
optimal  elements,  vs.  optimal  approximation  by  53  equispaced  nodes,  as 
described  in  Example  5.4 


Figure  8a:  The  approximation  to  the  cosecant  pattern  generated  by  47 
optimal  elements,  as  described  in  Example  5.4 
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A  procedure  is  reported  for  the  compression  of  rank-deficient  matrices.  A  matrix  A  of  rank 
k  is  represented  in  the  form  A  =  U  o  B  o  V,  where  B  is  a  k  x  k  submatrix  of  A,  and  U ,  V 
are  well-conditioned  matrices  that  each  contain  a  k  x  k  identity  submatrix.  This  property 
enables  such  compression  schemes  to  be  used  in  certain  situations  where  the  SVD  cannot 
be  used  efficiently.  Numerical  examples  are  presented. 
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On  the  compression  of  low  rank  matrices 

H.  Cheng,  Z.  Gimbutas,  P.G.  Martinsson,  V.  Rokhlin 


Abstract:  A  procedure  is  reported  for  the  compression  of  rank-deficient 
matrices.  A  matrix  A  of  rank  k  is  represented  in  the  form  A  =  UoBoV, 
where  B  is  a  k  x  k  submatrix  of  A,  and  U,  V  are  well-conditioned 
matrices  that  each  contain  a  k  x  k  identity  submatrix.  This  property 
enables  such  compression  schemes  to  be  used  in  certain  situations  where 
the  SVD  cannot  be  used  efficiently.  Numerical  examples  are  presented. 

1.  Introduction 

In  computational  physics  (and  many  other  areas),  one  often  encounters  matrices  whose  ranks 
are  (to  high  precision)  much  lower  than  their  dimensionalities;  even  more  frequently,  one  is  con¬ 
fronted  with  matrices  posessing  large  submatrices  that  are  of  low  rank.  An  obvious  source  of  such 
matrices  is  the  potential  theory,  where  discretization  of  integral  equations  almost  always  results  in 
matrices  of  this  type.  Such  matrices  are  also  encountered  in  fluid  dynamics,  numerical  simulation 
of  electromagnetic  phenomena,  structural  mechanics,  multivariate  statistics  etc.  In  such  cases,  one 
is  tempted  to  “compress”  the  matrices  in  question,  so  that  they  could  be  efficiently  applied  to  arbi¬ 
trary  vectors;  compression  also  facilitates  the  storage  and  any  other  manipulation  of  such  matrices 
that  might  be  desirable. 

At  this  time,  several  classes  of  algorithms  exist  that  use  this  observation.  The  so-called  Fast 
Multipole  Methods  (FMMs)  are  algorithms  for  the  application  of  certain  classes  of  matrices  to 
arbitrary  vectors;  FMMs  tend  to  be  extremely  efficient,  but  are  only  applicable  to  very  narrow 
classes  of  operators  (see  [7]).  Another  approach  to  the  compression  of  operators  is  based  on  the 
wavelets  and  related  structures  (see,  for  example,  [3,  2]);  these  schemes  exploit  the  smoothness 
of  the  elements  of  the  matrix  viewed  as  a  function  of  their  indices,  and  tend  to  fail  for  highly 
oscillatory  operators. 

Finally,  there  is  a  class  of  compression  schemes  that  are  based  purely  on  linear  algebra,  and  are 
completely  insensitive  the  the  analytical  origin  of  the  operator.  It  consists  of  the  Singular  Value 
Decomposition  (SVD),  the  so-called  QR  and  QLP  factorizations  [8],  and  several  others.  Given  an 
m  x  n- matrix  A  of  rank  k  <  min (m,n),  the  SVD  represents  A  in  the  form 

(1.1)  A  =  U°DoV, 

with  U  an  m  x  k,  matrix  whose  columns  are  orthonormal,  V  &  k  x  n  matrix  whose  rows  are 
orthonormal,  and  D  a  diagonal  matrix  whose  diagonal  elements  are  positive.  The  compression 
provided  by  the  SVD  is  perfect  in  terms  of  accuracy  (see,  for  example,  [5]),  and  has  a  simple 
geometric  interpretation:  it  expresses  each  of  the  columns  of  A  as  a  linear  combination  of  the  k 
(orthonormal)  columns  of  U]  it  also  represents  the  rows  of  A  as  linear  combinations  of  (orthonormal) 
rows  of  V ;  and  the  matrices  U,  V  are  chosen  in  such  a  manner  that  the  rows  of  U  are  images  (up 
to  a  scaling)  under  A  of  the  columns  of  V. 

In  this  paper,  we  propose  a  different  matrix  decomposition.  Specifically,  we  represent  the  matrix 
A  described  above  in  the  form 

(1.2)  A  =  UoBoV, 

l 


where  B  is  a  k  x  fc-submatrix  of  A ,  and  the  norms  of  the  matrices  U,  V  (of  dimensionalities  n  x  /c, 
k  x  m  respectively)  are  reasonably  close  to  1  (see  Theorem  3  in  Section  3  below).  Furthermore, 
each  of  the  matrices  U,  V  contains  a  unity  k  x  k  submatrix. 

Like  (1.1),  the  representation  (1.2)  has  a  simple  geometric  interpretation:  it  expresses  each  of 
the  columns  of  A  as  a  linear  combination  of  k  selected  columns  of  A ,  and  each  of  the  rows  of  A  as 
a  linear  combination  of  k  selected  rows  of  A.  This  selection  defines  akxk  submatrix  B  of  A,  and 
in  the  resulting  system  of  coordinates,  the  action  of  A  is  represented  by  the  action  of  its  submatrix 
B. 

The  representation  (1.2)  has  the  advantage  that  the  bases  used  for  the  representation  of  the 
mapping  A  consists  of  the  columns  and  rows  of  A,  while  each  of  the  elements  of  the  bases  in  the 
representation  (1.1)  is  itself  a  linear  combination  of  all  rows  (or  columns)  of  the  matrix  A.  In 
Section  5,  we  illustrate  the  advantages  of  the  representation  (1.2)  by  constructing  an  accelerated 
direct  solver  for  integral  equations  of  potential  theory. 

Another  advantage  of  the  representation  (1.2)  is  that  the  numerical  procedure  for  constructing 
it  is  considerably  less  expensive  than  that  for  the  construction  of  the  SVD  (see  Section  4),  and  that 
the  cost  of  applying  (1.2)  to  an  arbitrary  vector  is 

(1.3)  (n  +  m-k)-k, 

vs. 

(1.4)  (n  +  m)‘k 
for  the  SVD. 

The  obvious  disadvantage  of  (1.2)  vis-a-vis  (1.1)  is  the  fact  that  the  norms  of  the  the  matrices 
U,V  are  somewhat  greater  than  1,  leading  to  some  (though  minor)  loss  of  accuracy.  Another 
disadvantage  of  the  proposed  factorization  is  its  non-uniqueness;  in  this  respect  it  is  similar  to  the 
pivoted  QR  factorization. 

Remark  1.  In  (1.2),  the  submatrix  B  of  the  matrix  A  is  defined  as  the  intersection  of  k  columns 
with  k  rows.  Denoting  the  sequence  numbers  of  the  rows  by  *i,  ■ •  • ,  **:  and  the  sequence  numbers 

of  the  columns  by  31,32,  •  .  • ,  jk,  we  will  be  referring  to  the  submatrix  B  of  A  as  the  skeleton  of  A, 
to  the  kxn  matrix  consisting  of  the  rows  of  A  numbered  »i,«2>  •■•Ak  as  the  row  skeleton  of  A,  and 
to  the  mxk  matrix  consisting  of  the  columns  of  A  numbered  31,32,  •••■,3  k  as  the  column  skeleton 
of  A. 

The  structure  of  this  paper  is  as  follows.  Section  2  below  summarizes  several  facts  from  numerical 
linear  algebra  to  be  used  in  the  remainder  of  the  paper.  In  Section  3,  we  prove  the  existence  of 
a  stable  factorization  of  the  form  (1.2).  In  Section  4,  we  describe  a  reasonably  efficient  numerical 
algorithm  for  constructing  such  a  factorization.  In  Section  5,  we  illustrate  how  the  geometric 
properties  of  the  factorization  (1.2)  can  be  utilized  to  construct  an  accelerated  direct  solver  for 
integral  equations  of  potential  theory.  In  Section  6,  we  present  the  results  of  numerical  experiments 
with  the  direct  solver.  Finally,  Section  7  contains  a  discussion  of  other  possible  applications  of  the 
techniques  of  this  paper. 


2.  Preliminaries 

In  this  section  we  introduce  our  notation  and  summarize  several  facts  from  numerical  linear 
algebra;  these  can  all  be  found  in  [1]. 
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4  Throughout  the  paper,  we  use  upper  case  letters  for  matrices  and  lower  case  letters  for  vectors 

and  scalars.  We  reserve  Q  for  matrices  that  have  orthonormal  columns  and  P  for  permutation 
matrices.  The  canonical  unit  vectors  in  Cn  are  denoted  by  ej.  Given  a  matrix  X ,  we  let  X*  denote 
•  its  adjoint  (the  complex  conjugate  transpose),  Ok{X)  its  &-th  singular  value,  ||X||2  its  l2-norm  and 

||X||f  its  Frobenius  norm.  Finally,  given  matrices  A,  B,  C  and  D  we  let 


(2.1) 


and 


'  A 

B  ' 

C 

D 

denote  larger  matrices  obtained  by  stringing  the  blocks  A,  B,  C  and  D  together. 

The  first  result  that  we  present  asserts  that  given  any  matrix  A,  it  is  possible  to  reorder  its 
columns  to  form  a  matrix  AP ,  where  P  is  a  permutation  matrix,  with  the  following  property: 
When  AP  is  factorized  into  an  orthonormal  matrix  Q  and  an  upper  triangular  matrix  R,  so  that 
AP  =  QR,  then  the  singular  values  of  the  leading  k  x  k  submatrix  of  R  are  reasonably  good 
approximations  of  the  first  k  singular  values  of  A.  The  theorem  also  says  that  the  first  k  columns 
of  AP  form  a  well-conditioned  basis  for  the  column  space  of  A  to  within  accuracy  Ofc+i(A). 


Theorem  1.  [Gu  &  Eisenstat]  Suppose  that  A  is  anmxn  matrix,  l  =  min(m, n),  and  k  is  an 
integer  such  that  1  <  k  <  l.  Then  there  exists  a  factorization 

(2.2)  AP  =  QR, 


where  P  is  annxn  permutation  matrix,  Q  is  an  m  x  l  matrix  with  orthonormal  columns,  and  R 
is  an  l  x  n  upper  triangular  matrix.  Furthermore,  splitting  Q  and  R, 


(2.3) 


Q  = 


Q11 

Q12 

.  Q21 

Q22  . 

and  R  = 


fin 

R\2 

0 

R22  . 

in  such  a  fashion  that  Qn  and  Rn  are  of  size  kxk,  Q21  is  ( m  —  k)x  k,  Qn  is  kx  (l  —  k),  Q22  is 
(m  —  k)x  (l  —  k),  R\2  is  kx  (n  —  k)  and  R22  is  (l  —  k)x(n  —  k),  results  in  the  following  inequalities: 


(2.4) 


M-Rn)  >  <?k(A) 


1 

y/l  +  fc(n  —  k )’ 


(2.5)  <71(^22)  <  <Tfc+i(i4)x/l  +  &(n  —  k), 
and 

(2.6)  IK/^IIf  <  \A(n  -  k). 

Remark  2.  In  this  paper  we  do  not  use  the  full  power  of  Theorem  1  since  we  are  only  concerned 
with  the  case  of  very  small  e  =  <7fc+i(A).  In  this  case,  the  inequality  (2.5)  implies  that  A  can  be 
well  approximated  by  a  low-rank  matrix.  In  particular,  (2.5)  implies  that 


(2.7) 


fan  |  Rn]  P*\\2  <  ey/l  +  *(«-*)• 


Furthermore,  the  inequality  (2.6)  in  this  case  implies  that  the  first  k  columns  of  AP  form  a  well- 
conditioned  basis  for  the  entire  column  space  of  A  (within  accuracy  e). 


While  Theorem  1  asserts  the  existence  of  a  factorization  (2.2)  with  the  properties  (2.4),  (2.5), 
*  (2-6),  it  says  nothing  about  the  cost  of  constructing  such  a  factorization  numerically.  The  following 

theorem  asserts  that  a  factorization  that  satisfies  bounds  that  are  weaker  than  (2.4),  (2.5),  (2.6) 
by  a  factor  of  yfn  can  be  computed  in  0(mn2)  operations. 
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Theorem  2.  [Gu  &  Eisenstat]  Given  an  m  x  n  matrix  A,  a  factorization  of  the  form  (2.2)  that 
instead  of  (2-4),  (2.5)  and  (2.6)  satisfies  the  inequalities 


(2.8) 


ak{R u )  > 


\/l  +  nk(n  —  k) 


°k{A), 


(2.9)  o-|(i?22)  <  \A  +  nk(n  -  k)crk+i{A), 
and 

(2.10)  ||i?n -R^IIf  <  Vnk(n-k), 
can  be  computed  in  0(mn 2)  operations. 


3.  Analytical  apparatus 


In  this  section  we  prove  that  the  factorization  (1.2)  exists  by  applying  Theorem  1  to  both  the 
columns  and  the  rows  of  the  matrix  A.  Theorem  2  then  guarantees  that  the  factorization  can  be 
computed  efficiently. 

The  following  theorem  is  the  principal  analytic  tool  of  this  paper. 


Theorem  3.  Suppose  that  A  is  anmxn  matrix  and  let  k  be  such  that  1  <  k  <  min(m,n).  Then 
there  exists  a  factorization 


(3.1) 


A  =  Pl 


As  [/|  T]  Pr  +  X, 


where  I  €  Ckxk  is  the  identity  matrix,  PL  and  Pr  are  permutation  matrices,  and  As  is  the  top  left 
kxk  submatrix  of  P£ A  Pr.  In  (3.1),  the  matrices  S  6  C{m~k)xk  and  T  e  Ckx(n'k)  satisfy  the 
inequalities 

(3.2)  ||S||f  <  \/k(m  -  k),  and  ||T||f  <  \/k(n  —  A:), 

and  the  matrix  X  is  small  if  the  (k  +  1  )-th  singular  value  of  A  is  small, 


(3.3) 


IIAII2  <  crie+i (A) y/l  +  A:(min(m, n)  -  A:) . 


Proof:  The  proof  consists  of  two  steps.  First  Theorem  1  is  invoked  to  assert  the  existence  of  k 
columns  of  A  that  form  a  well-conditioned  basis  for  the  column  space  within  accuracy  ok+i{A)', 
these  are  collected  in  the  m  x  k  matrix  Acs-  Then  Theorem  1  is  invoked  again  to  prove  that  k 
of  the  rows  of  Acs  form  a  well-conditioned  basis  for  its  row  space.  Without  loss  of  generality,  we 
assume  that  m  >  n  and  that  cr*(A)  7^  0. 

For  the  first  step  we  factor  A  into  matrices  Q  and  R  as  specified  by  Theorem  1,  letting  Pr  denote 
the  permutation  matrix.  Splitting  Q  and  R  into  submatrices  Qij  and  Rij  as  in  (2.3),  we  reorganize 
the  factorization  (2.2)  as  follows, 


(3.4)  AFfe.  = 


On 

Q21 


[Pn|  P12]  + 


Q12 

Q22 


[0|  R22]  = 


Qnflii 

Q21R11 


[I\  RjRu} 


We  now  define  the  matrix  T  €  C/cx(n_fc)  via  the  formula 


(3.5) 


T  —  Pn  P12! 
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0  Q12R22 
0  Q22R22 


T  satisfies  the  inequality  (3.2)  by  virtue  of  (2.6).  We  define  the  matrix  X  €  <Cmxn  via  the  formula 


(3.6) 


X  = 


'  0 

Ql2#22 

0 

Ql\R22  . 

ph 


which  satisfies  the  inequality  (3.3)  by  virtue  of  (2.5).  Defining  the  matrix  Acs  €  Cmx/c  by 

Qii-Rn 


(3.7) 


Acs  = 


Q2\R\\ 

we  reduce  equation  (3.4)  to  the  form 

(3.8)  APr  =  Acs  [7|  P]  +  XPr. 

An  obvious  interpretation  of  (3.8)  is  that  Acs  consists  of  the  first  k  columns  of  the  matrix  APr 
(since  the  corresponding  columns  of  XPr  are  identically  zero). 

The  second  step  of  the  proof  is  to  find  k  rows  of  Acs  forming  a  well-conditioned  basis  for  its 
row-space.  To  this  end,  we  factor  the  transpose  of  Acs  as  specified  by  Theorem  1, 

(3.9)  Aics-Fl  =  Q  [-Rii|  -R12]  • 

Transposing  (3.9)  and  rearranging  the  terms  we  have 

R*n 


(3.10) 


PlAcs 


Rh 


Q1 


1 

~  .  Ri2(R*n)~l 


R*uQ*. 


Multiplying  (3.8)  by  P£  and  using  (3.10)  to  substitute  for  i^Acs  we  obtain 

I 


(3.11) 


Pi  APr  = 


L  RU&u) 


**  \  — 1 


R*iQ*  [I\T]  +PIXPR. 


We  now  convert  (3.11)  into  (3.1)  by  defining  the  matrices  As  €  Ckxk  and  S  €  C^n  k^xk  via  the 
formulae 

(3.12)  As  =  R*nQ\  and  S  =  R*n(R*n)~\ 

respectively.  □ 

Remark  3.  While  the  definition  (3.5)  serves  its  purpose  within  the  proof  of  Theorem  3,  it  is 
somewhat  misleading.  Indeed,  it  is  more  reasonable  to  define  T  as  a  solution  of  the  equation 

(3.13)  ll-RnT1  —  R12II2  <  &k+i(A) y/l  +  A;(n  —  k). 

When  the  solution  is  non-unique  we  chose  a  solution  that  minimizes  ||T||f.  PVom  the  numerical 
point  of  view,  the  definition  (3.13)  is  much  preferable  to  (3.5)  since  it  is  almost  invariably  the  case 
that  ilii  is  highly  ill-conditioned,  if  not  outright  singular. 

Introducing  the  notation 

/ 


(3.14) 


Acs  =  Pl 


As  e  C 


nxk 


and  Ars  =  As  [/|  T]  Pr  €  C 


1  kxm 


we  observe  that  under  the  conditions  of  Theorem  3,  the  factorization  (3.1)  can  be  rewritten  in  the 
forms 

(3.15)  A  =  ACs[/|T]P£  +  X, 
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and 


(3.16) 


A  =  Pl 


Ars  +  X. 


The  matrix  Acs  consists  of  k  of  the  columns  of  A,  while  Ars  consists  of  k  of  the  rows.  We  refer  to 
As  as  the  skeleton  of  A,  and  to  Acs  and  Ars  as  the  column  and  row  skeletons,  respectively. 


Remark  4.  While  Theorem  3  guarantees  the  existence  of  a  well-conditioned  factorization  of  the 
form  (3.1),  it  says  nothing  about  the  cost  of  obtaining  such  a  factorization.  However,  it  follows 
immediately  from  Theorem  2  that  a  factorization  (3.1)  with  the  matrices  S,  T,  and  X  satisfying 
the  weaker  bounds 

(3.17)  ||S||2  <  s/mk{m  -  k),  and  ||T||2  <  y/n A(n  -  *), 
and,  with  l  =  min(m,n), 

(3.18)  ||.X||2  <  y/l+im -  k)crk+i(A), 
can  be  constructed  at  the  cost  0{mnl). 


Observation  1.  The  relations  (3.1),  (3.15),  (3.16)  have  simple  geometric  interpretations.  Specif¬ 
ically,  (3.15)  asserts  that  for  a  matrix  A  of  rank  k,  it  is  possible  to  select  k  columns  that  form  a 
well-conditioned  basis  of  the  entire  column  space.  Let  j\,  ■  ■  •  ,jk  €  {1, . . .  ,n}  denote  the  indices  of 
those  columns  and  let  Xk  =  sparse*,.  ..,eJk)  C  Cn  (thus,  Xk  is  the  space  of  vectors  whose  only 
non-zero  coordinates  are  x,-, , . .  .  ,xJfc).  According  to  Theorem  3,  there  exists  an  operator 

(3.19)  Proj  :  C"  -*•  Xk, 


defined  by  the  formula 

(3.20) 


Proj  =  Pr 


"  I 

T 

1 

3 

Pt 


R) 


such  that  the  diagram 


(3.21) 


is  commutative.  Here,  A'cs  is  the  mxn  matrix  formed  by  setting  all  columns  of  A  except  ji, . . . ,  jk 
to  zero.  Furthermore,  o\  (Proj )/ak (Proj)  <  y/l  +  k(n  -  k).  Similarly,  equation  (3.16)  asserts  the 
existence  of  k  rows,  say  with  indices  ii,...,ik  G  {1, •  •  • , rn},  that  form  a  well-conditioned  basis  for 
the  entire  row-space.  Setting  Yk  =  span(eil, . . .  ,e,  J  C  Cm,  there  exists  an  operator 

(3.22)  Eval  :  Yk  ->  C™, 
defined  by 

(3.23)  Eval  =  FL  J  I  0  J 
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such  that  the  diagram 
(3.24) 


A 


Cm 


is  commutative.  Here,  i4JiS  is  the  in  x  n  matrix  formed  by  setting  all  rows  of  A  except  to 

zero.  Furthermore,  <j\  (Eval) /a/,  (Eval)  <  */l  +  k(m  —  k).  Filially,  the  geometric  interpretation  of 
(3.1)  is  the  combination  of  the  diagrams  (3.21)  and  (3.24), 


(3.25) 


C" 

I'roj 

av 


cm 

Eval 

n 


|  Bv 


Here,  A'§  is  the  m  x  n  matrix  formed  by  setting  all  entries  of  A,  except  those  at  the  intersection  of 
the  rows  »i, . . . ,t*  with  the  columns  to  zero. 

As  a  comparison,  we  consider  the  diagram 

(3.26)  Cn  Cm 

‘‘J  h 


obtained  when  the  SVD  is  used  to  compress  the  matrix  A  €  Cmxn.  Here,  £>*  is  the  k  x  k  diagonal 
matrix  formed  by  the  k  largest  singular  values  of  A,  and  V*  and  17*  are  column  matrices  containing 
the  corresponding  right  and  left  singular  vectors,  respectively.  The  factorization  (3.25)  has  the 
advantage  over  (3.26)  that  the  mappings  Proj  and  Eval  leave  k  of  the  coordinates  invariant.  This 
is  gained  at  the  price  of  non-orthonormality  of  these  mappings. 


4.  Numerical  apparatus 

In  this  section,  we  present  a  simple  ami  reasonably  efficient  procedure  for  computing  the  factor¬ 
ization  (3.1).  It.  has  been  extensively  tested  and  consistently  produces  factorizations  that  satisfy 
the  bounds  (3.17).  While  there  exist  matrices  for  which  this  simple  approach  will  not  work  well, 
they  appear  to  be  exceedingly  rare. 

Given  an  m  x  n  matrix  A,  the  first  step  (out  of  four)  is  to  apply  the  pivoted  Gram-Schmidt 
process  to  its  columns.  The  process  is  halted  when  the  column  space  has  been  exhausted  to  a 
preset  accuracy  e,  leaving  a  factorization 

(4.1)  AP,(=Q[i?H|il12], 

where  Pr  €  C"xn  is  a  permutation  matrix,  Q  6  Crnxk  has  orthonormal  columns,  Ru  €  Ckxk  is 
upper  triangular,  and  Rvj  € 

The  second  step  is  to  find  a  matrix  T  €  Ckx(n~k)  that  solves  the  equation 

(4.2)  RnT  =  R12 


to  within  accuracy  e.  When  R\  \  is  ill-conditioned,  there  is  a  large  set  of  solutions;  we  pick  one  for 
which  ||T||f  is  small. 

Letting  .Acs  €  Cmxk  denote  the  matrix  formed  by  the  first  k  columns  of  AFfo.,  we  now  have  a 
factorization 

(4.3)  A  =  Acs[I\T]P&. 

The  third  and  the  fourth  steps  are  entirely  analogous  to  the  first  and  the  second,  but  are  concerned 
with  finding  k  rows  of  Acs  that,  form  a  basis  for  its  row-space.  They  result  in  a  factorization 

(4.4)  -Acs  =  Pp  -g-  -As- 

The  desired  factorization  is  now  obtained  by  inserting  (4.4)  into  (4.3): 

(4.5)  A  =  Pl  ;4s[/|T]/$. 

For  this  technique  to  be  successful,  it  is  crucially  important  that  the  Gram-Schmidt  factorization 
be  performed  accurately.  Modified  Gram-Schmidt  or  the  method  using  Householder  reflectors  are 
not  accurate  enough.  Instead,  we  use  a  technique  that  is  based  on  modified  Gram-Schmidt,  but 
that  at  each  step  re-orthogonalizes  the  vector  chosen  to  add  to  the  basis  before  adding  it.  In  exact 
arithmetic,  this  step  would  be  superfluous,  but  in  the  presence  of  round-off  error  it  greatly  increases 
the  quality  of  the  factorization  generated,  see  e.g.  [6]. 

5.  Application:  An  accelerated  direct  solver  for  integral  equations 

In  this  section  we  use  the  matrix  compression  technique  presented  in  Section  3  to  construct 
an  accelerated  direct  solver  for  boundary  integral  equations  with  non-oscillatory  kernels.  Upon 
discretization,  such  equations  lead  to  dense  systems  of  linear  equations,  and  iterative  methods 
combined  with  fast  matrix-vector  multiplication  techniques  are  commonly  used  to  obtain  the  so¬ 
lution.  Many  such  fast  multiplication  techniques  take  advantage  of  the  fact  that  the  off-diagonal 
blocks  of  the  discrete  system  typically  have  low  rank.  Employing  the  matrix  compression  techniques 
presented  in  Section  3,  we  use  this  low-rank  property  to  accelerate  direct,  rather  than  iterative, 
solution  techniques.  The  method  uses  no  machinery  beyond  what  is  described  in  Section  3  and  is 
applicable  to  most  integral  equations  involving  non-oscillatory  kernels. 

For  concreteness,  we  consider  the  equation 

(5.1)  u{x)  +  J  K(x,y)u(y)dy  -  f(x),  for  x  €  T, 

where  T  is  some  contour  and  K{x,y)  is  a  non-oscillatory  kernel.  The  function  u  represents  an 
unknown  “charge”  distribution  on  T  that  is  to  be  determined  from  the  given  function  /.  The 
method  that  we  present  works  for  almost  any  contour  but  for  simplicity,  we  will  assume  that  the 

contour  consists  of  p  disjoint  pieces,  F  =  TH - Up,  where  all  pieces  have  similar  size  (an  example 

is  given  in  Fig.  3).  In  fact,  to  simplify  the  formulas,  we  will  for  the  most  part  set  p  =  3. 
Discretizing  each  contour  T;  using  n  points,  the  equation  (5.1)  takes  the  form 
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(a)  (b)  (c) 


FIGURE  1.  Zeros  axe  introduced  into  the  matrix  in  three  steps:  (a)  interaction 
between  Ti  and  the  other  contours  is  compressed,  (b)  interaction  with  Tj  is  com¬ 
pressed,  (c)  interaction  with  T3  is  compressed.  The  small  black  blocks  are  of  size 
k  x  k  and  consist  of  entries  that  have  not  been  changed  beyond  permutations,  grey 
blocks  refer  to  updated  parts  and  white  blocks  are  all  zero  entries. 

where  ti W  €  Cn  and  /W  G  Cn  are  discrete  representations  of  the  unknown  boundary  charge 
distribution  and  the  right  hand  side  associated  with  T,-,  and  €  Cn*n  is  a  dense  matrix 

representing  the  evaluation  of  a  potential  on  Ti  caused  by  a  charge  distribution  on  Tj. 

The  interaction  between  Tj  and  the  rest  of  the  contour  is  governed  by  the  matrices 

(5.3)  ff(1)  =  [M(l-2>|M(1’3)]  GClx2n,  and  7(1)  =  '^WT  €  C2nxw. 

For  non-oscillatory  kernels,  these  matrices  are  typically  highly  rank-deficient.  We  let  k  denote  an 
upper  bound  on  their  ranks  (to  within  some  preset  level  of  accuracy  e).  By  virtue  of  (3.16),  we 
know  that  there  exist  k  rows  of  which  form  a  well-conditioned  basis  for  all  the  n  rows.  In 
other  words,  there  exists  a  well-conditioned  n  x  n  matrix  (see  Remark  6)  such  that 

(5.4)  =  [jSgflJ  +  0(e), 

I  Z  l 

where  ffjg  is  a  kx2n  matrix  formed  by  k  of  the  rows  of  and  Z  is  the  (n—k)x  2  n  zero  matrix. 
There  similarly  exist  an  n  x  n  matrix  such  that 

(5.5)  F<1)r(1)=  [V^IZ*]  +0(e), 

where  V^g  is  a  2  nxk  matrix  formed  by  k  of  the  columns  of  V^.  For  simplicity,  we  will  henceforth 
assume  that  the  off-diagonal  blocks  have  exact  rank  at  most  k  and  ignore  the  error  terms. 

The  relations  (5.4)  and  (5.5)  imply  that  by  restructuring  equation  (5.2)  as  follows, 


we  introduce  large  blocks  of  zeros  in  the  matrix,  as  shown  in  Figure  1(a). 

Next,  we  compress  the  interaction  between  T2  and  the  rest  of  the  contour  to  obtain  the  matrix 
structure  shown  in  Fig.  1(b).  Repeating  the  process  with  Ta,  we  obtain  the  final  structure  shown 
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Figure  2.  In  order  to  determine  the  R W  and  that  compress  the  interaction 
between  T,-  (shown  in  bold)  and  the  remaining  contours,  it  is  sufficient  to  consider 
only  the  interactions  between  the  contours  drawn  with  a  solid  line  in  (b). 


in  Fig.  1(c).  At  this  point,  we  have  constructed  matrices  R^  and  Z,W  and  formed  the  new  system 


lWmWrW 

LWmWrW 

(RWrv1)  i 

LW /(!)  1 

(5-7) 

WJWVWT 

LWmWrW 

lwmwrw 

(R^)-V^ 

= 

ifi)  f&) 

_  lFTm&ZTrF) 

l^mwrW  J 

.  (JH3J)-ltii*>  J 

whose  matrix  is  shown  in  Figure  1(c).  We  emphasize  that  the  k  x  k  non-zero  parts  of  the  off- 
diagonal  blocks  are  submatrices  of  the  original  n  x  n  off-diagonal  blocks.  The  parts  of  the  matrix 
that  are  shown  as  grey  in  the  figure  represent  interactions  that  are  internal  to  each  contour.  These 
n  —  k  degrees  of  freedom  per  contour  can  be  eliminated  by  performing  a  local,  0(n 3),  operation  for 
each  contour.  This  leaves  a  dense  system  of  3  x  3  blocks,  each  of  size  k  x  k.  Thus,  we  have  reduced 
the  problem  size  by  a  factor  of  n/k. 

Remark  5.  For  the  algorithm  presented  above,  the  compression  of  the  interaction  between  a  fixed 
contour  and  its  p  —  1  fellows  is  quite  costly  since  it  requires  the  construction  and  compression  of  the 
large  matrices  H®  €  Cnx^p_1^n  and  V®  €  C^p-1^xn.  In  the  numerical  examples  presented  below, 
this  step  is  avoided  by  constructing  matrices  L W  and  R^  that  satisfy  (5.4)  and  (5.5)  through  an 
entirely  local  procedure.  We  illustrate  how  this  is  done  by  considering  the  contours  in  Fig.  2(a) 
and  supposing  that  we  want  to  find  the  transforms  that  compress  the  interaction  of  the  contour  T* 
(drawn  with  a  bold  line)  with  the  remaining  ones.  This  can  be  done  by  compressing  the  interaction 
between  T*  and  an  artificial  contour  Taxtif  that  surrounds  I\  (as  shown  in  Fig.  2(b))  combined  with 
the  parts  of  the  other  contours  that  penetrate  it.  This  procedure  works  for  any  potential  problem 
for  which  the  Green’s  identities  hold.  The  computational  cost  for  one  compression  is  0(kn2)  rather 
than  the  0(pkn 2)  cost  for  constructing  and  compressing  the  entire  and  V^. 

To  sum  up:  The  accelerated  solver  consists  of  four  steps.  For  a  problem  involving  p  contours, 
each  of  which  is  discretized  using  n  nodes  and  having  off-diagonal  blocks  of  rank  at  most  k,  they 
are: 

(1)  The  off-diagonal  blocks  are  skeletonized  and  the  diagonal  n  x  n  blocks  are  updated  at  a 
cost  of  Oipkn 2)  using  the  technique  described  in  Remark  5. 
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« 


4 
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(2)  The  n  —  k  degrees  of  freedom  that  represent  internal  interactions  for  each  contour  axe 
eliminated  at  a  cost  of  0(pn 3). 

(3)  The  reduced  kp  x  kp  system  is  solved  at  a  cost  of  0(fc3p3). 

(4)  The  solution  of  the  original  system  is  reconstructed  from  the  solution  of  the  reduced  problem 
through  p  local  operations  at  a  cost  of  0(pn2). 

The  third  step  is  typically  the  most  expensive  one  with  an  asymptotic  cost  of  f(comP)  ~  cfc3p3.  The 
cost  of  a  solution  of  the  uncompressed  equations  is  t(uncomP)  ~  cn3p3.  Consequently; 


Speed-up  = 


^(uncomp) 

T(  con,p) 


Remark  6.  The  existence  of  the  matrices  L W  and  R^  are  direct  consequences  of  (3.16)  and 
(3.15),  respectively.  Specifically,  substituting  H W  for  A  in  (3.16),  we  obtain 


(5.8) 


PtH^  = 


I_ 

s 


jjW 

■"rs* 


where  is  the  k  x  2n  matrix  consisting  of  the  top  k  rows  of  The  relation  (5.4)  now 

follows  from  (5.8)  by  defining 


(5.9) 


L™  = 


'I  O' 
-S  I 


PI 


We  note  that  the  largest  and  smallest  singular  values  of  satisfy 

<n(£(1))<(l  +  l|S|l?>)1/!. 
a„(£(1,)>(l  +  l|S||?!)'1/2- 


Thus  cond(T^))  <  1  +  ||S,||22,  which  is  of  moderate  size  according  to  Theorem  3.  The  matrix 
is  similarly  constructed  by  forming  the  column  skeleton  of  V^. 


Remark  7.  Equations  (5.4)  and  (5.5)  have  simple  heuristic  interpretations:  Equation  (5.4)  says 
that  it  is  possible  to  choose  k  points  on  the  contour  Ti  in  such  a  way  that  when  a  field  generated  by 
charge  distributions  on  the  rest  of  the  contour  is  known  at  those  points,  it  is  possible  to  extrapolate 
the  field  at  the  remaining  points  on  Ti  from  those  values.  Equation  (5.5)  says  that  it  is  possible  to 
choose  k  points  on  Ti  in  such  a  way  that  any  field  on  the  rest  of  the  contour  generated  by  charges 
on  Ti,  can  be  replicated  by  placing  charges  only  on  those  k  points. 

Remark  8.  It  is  sometimes  advantageous  to  choose  the  same  k  points  when  constructing  the 
skeletons  of  and  This  can  be  achieved  by  compressing  the  two  matrices  jointly,  for 

instance  by  forming  the  row  skeleton  of  [H^|  (FW)*].  In  this  case  =  (R^)*.  When  this  is 
done,  the  compression  ratio  deteriorates  since  the  singular  values  of  [R^|  (yW)*]  decay  slower 
than  those  of  either  H W  or  VW,  as  is  seen  by  comparing  Figures  4  and  5. 

Remark  9.  When  the  solution  of  equation  (5.2)  is  sought  for  multiple  right-hand  sides,  the  cost 
of  the  first  solve  is  0(mnk).  Subsequent  solves  can  be  preformed  using  0(p2k 2  +pn2)  operations 
rather  than  0(p2n2)  for  an  uncompressed  solver. 
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Figure  3.  The  contours  used  for  the  numerical  calculations  with  p  =  128.  Picture 
(a)  shows  the  full  contour  and  a  box  (which  is  not  part  of  the  contour)  that  indicates 
the  location  of  the  close-up  shown  in  (b). 


Remark  10.  The  direct  solver  that  we  have  presented  has  a  computational  complexity  that  scales 
cubically  with  the  problem  size  N  and  is  thus  not  a  “fast”  algorithm.  However,  by  applying  the 
techniques  presented  recursively,  it  is  possible  to  reduce  the  asymptotic  complexity  to  0(1V3'2), 
and  possibly  even  O(NlogN).  This  is  a  topic  of  current  research. 

6.  Numerical  results 

The  algorithm  described  in  Section  5  has  been  computationally  tested  on  the  second  kind  integral 
equation  obtained  by  discretizing  an  exterior  Dirichlet  boundary  value  problem  using  the  double 
layer  kernel.  The  contours  used  consisted  of  a  number  of  jagged  circles  arranged  in  a  skewed  square 
as  shown  in  Fig.  3.  The  number  of  contours  p  ranged  from  8  to  128.  For  this  problem,  n  =  200 
points  per  contour  were  required  to  obtain  a  relative  accuracy  of  e  =  10-6.  We  found  that  to  this 
level  of  accuracy,  no  H W  or  had  rank  exceeding  k  =  50.  As  an  example,  we  show  in  Fig.  4 
the  singular  values  of  the  matrices  if  ^  and  V^1-*  representing  interactions  between  the  highlighted 
contour  in  Fig.  2(a)  and  the  remaining  ones. 

The  algorithm  described  in  Section  5  was  implemented  in  FORTRAN  and  run  on  a  2.8GHz 
Pentium  IV  desktop  PC  with  512Mb  RAM.  The  CPU  times  for  a  range  of  different  problem  sizes 
are  presented  in  Table  1.  The  data  presented  supports  the  following  claims  for  the  compressed 
solver: 

•  For  large  problems,  the  CPU  time  speed-up  approaches  the  estimated  factor  of  (n/fc)3  =  64. 

•  The  reduced  memory  requirement  make  large  problems  amenable  to  direct  solution. 

Remark  11.  In  the  interest  of  simplicity,  we  forced  the  program  to  use  the  same  compression  ratio 
k/n  for  each  contour.  In  general,  it  detects  the  required  interaction  rank  of  each  contour  as  its 
interaction  matrices  are  being  compressed  and  uses  different  ranks  for  each  contour. 
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Figure  4.  Plots  of  the  singular  values  of  (a)  V®  and  (b)  for  a  discretization 
of  the  double  layer  kernel  associated  with  the  Laplace  operator  on  the  nine  contours 
depicted  in  Fig.  2(a).  In  the  example  shown,  the  contours  were  discretized  using 
n  =  200  points,  giving  a  relative  discretization  error  of  about  10-6.  The  plots  show 
that  to  that  level  of  accuracy,  the  matrices  VM  e  C1600x200  and  G  <0200x1600 
have  numerical  rank  less  than  k  =  50  (to  accuracy  10-6). 


p 

^(uncomp) 

j(comp) 

.(comp) 

linit 

.(comp) 

solve 

Error 

8 

5.6 

2.0  (4.6) 

1.6  (4.1) 

0.05 

8.1  •  10“r(1.4  •  10-  '1) 

16 

50 

4.1  (16.4) 

3.1  (15.5) 

0.4 

2.9  •  10“6(2.8  •  10“7) 

32 

451 

13.0  (72.1) 

6.4  (65.3) 

5.5 

4.4  •  10-6(4.4  •  10“7) 

64 

3700 

65  (270) 

14  (220) 

48 

— 

128 

30000 

480  (1400) 

31  (960) 

440 

— 

Table  1.  CPU  times  in  seconds  for  solving  (5.2).  p  is  the  number  of  contours. 
j(uncomp)  is  the  CPU  time  required  to  solve  the  uncompressed  equations;  the  numbers 
in  italics  are  estimated  since  these  problems  did  not  fit  in  RAM.  t<comP)  is  the 
CPU  time  to  solve  the  equations  using  the  compression  method;  this  time  is  split 
between  ,  the  time  to  compress  the  equations,  and  ,  the  time  to  solve 

the  reduced  system  of  equations.  The  error  is  the  relative  error  incurred  by  the 
compression  measured  in  the  maximum  norm  when  the  right  hand  side  is  a  vector 
of  ones.  Throughout  the  table,  the  numbers  in  parenthesis  refer  to  numbers  obtained 
when  the  technique  of  Remark  5  is  not  used. 


7.  Conclusions 

We  have  described  a  “compression”  scheme  for  low-rank  matrices.  For  a  matrix  A  of  dimen¬ 
sionality  m  x  n  and  rank  k,  the  factorization  can  be  applied  to  an  arbitrary  vector  for  the  cost  of 
( n  +  m-k)-k  operations,  after  a  significant  initial  factorization  cost;  this  is  marginally  faster  than 
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FIGURE  5.  Plot  of  the  singular  values  of  X®  =  [H^  |  (vM)*]  where  and  V® 
are  as  in  Figure  4.  The  numerical  rank  of  X W  is  approximately  80,  which  is  larger 
than  the  individual  ranks  of  H W  and  V^. 


the  cost  (n  +  m)  •  A:  produced  by  the  SVD.  The  factorization  cost  is  roughly  the  same  as  that  for 
the  rank-revealing  QR  decomposition  of  A. 

A  more  important  advantage  of  the  proposed  decomposition  is  the  fact  that  it  expresses  all  of 
the  mhimws  of  A  as  linear  combinations  of  k  appropriately  selected  columns  of  A,  and  all  of  the 
rows  of  A  as  linear  combinations  of  k  appropriately  selected  rows  of  A.  Since  each  of  the  basis 
vectors  (both  row  and  column)  produced  by  the  SVD  (or  any  other  classical  factorizations)  is  a 
linear  combination  of  all  rows  (columns)  of  A,  the  decomposition  we  propose  is  considerably  easier 
to  mfl.nipnla.tft;  we  illustrate  this  point  by  constructing  an  accelerated  scheme  for  the  direct  solution 
of  integral  equations  of  potential  theory  in  the  plane. 

A  related  advantage  of  the  proposed  decomposition  is  the  fact  that  one  frequently  encounters 
collections  of  matrices  such  that  the  same  selection  of  rows  and  columns  can  be  used  for  each  matrix 
to  span  its  row  and  column  space  (in  other  words,  there  exist  fixed  Pt,  a-nd  Pr  such  that  each  matrix 
in  the  collection  has  a  decomposition  (3.1)  with  small  matrices  S  and  T).  Once  one  matrix  in  such 
a  collection  has  been  factorized,  the  decomposition  of  the  remaining  ones  is  considerably  simplified 
since  the  skeleton  of  the  first  can  be  reused.  If  it  should  happen  that  the  skeleton  of  the  first  matrix 
that  was  decomposed  is  not  a  good  choice  for  some  other  matrix,  this  is  easily  detected  (since  then 
no  small  matrices  S  and  T  can  be  computed)  and  the  global  skeleton  can  be  extended  as  necessary. 

We  have  constructed  several  other  numerical  procedures  using  the  approach  described  in  this 
paper.  In  particular,  a  code  has  been  designed  for  the  (reasonably)  rapid  solution  of  scattering  prob¬ 
lems  in  the  plane  based  on  the  direct  (as  opposed  to  iterative)  solution  of  the  Lippman-Schwinger 
equation;  the  scheme  utilizes  the  same  idea  as  that  used  in  [4],  and  has  the  same  asymptotic  CPU 
time  estimate  0(AI3/2)  for  a  square  region  discretized  into  N  nodes.  However,  the  CPU  times 
obtained  by  us  are  a  significant  improvement  on  these  reported  in  [4];  the  paper  reporting  this 
work  is  in  preparation. 

It  also  appears  to  be  possible  to  utilize  the  techniques  of  this  paper  to  construct  an  order 
0(N  log  N)  (or  possibly  even  order  order  0(N)  (!))  scheme  for  the  solution  of  elliptic  PDEs  in 
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both  two  and  three  dimensions,  provided  that  the  associated  Green’s  function  is  not  oscillatory. 
This  work  is  in  progress,  and  if  successful  will  be  reported  at  a  later  date. 
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Abstract 

We  describe  an  algorithm  for  the  direct  solution  of  systems  of  linear  algebraic  equations  associated  with  the  discret- 
ization  of  boundary  integral  equations  with  non-oscillatory  kernels  in  two  dimensions.  The  algorithm  is  “fast”  in  the 
sense  that  its  asymptotic  complexity  is  0(n  log*  n),  where  n  is  the  number  of  nodes  in  the  discretization,  and  k  depends 
on  the  kernel  and  the  geometry  of  the  contour  (k  =  1  or  2).  Unlike  previous  fast  techniques  based  on  iterative  solvers, 
the  present  algorithm  directly  constructs  a  compressed  factorization  of  the  inverse  of  the  matrix;  thus  it  is  suitable  for 
problems  involving  relatively  ill-conditioned  matrices,  and  is  particularly  efficient  in  situations  involving  multiple  right 
hand  sides.  The  performance  of  the  scheme  is  illustrated  with  several  numerical  examples. 

©  2004  Elsevier  Inc.  All  rights  reserved. 


1.  Introduction 

The  boundary  value  problems  of  classical  potential  theory  are  ubiquitous  in  engineering  and  physics. 
Most  such  problems  can  be  reduced  to  boundary  integral  equations  which  are,  from  a  mathematical  point 
of  view,  more  tractable  than  the  original  differential  equations.  Although  the  mathematical  benefits  of  such 
reformulations  were  realized  and  exploited  in  the  19th  century,  until  recently  boundary  integral  equations 
were  rarely  used  as  numerical  tools,  since  most  integral  operators  upon  discretization  turn  into  dense  matri¬ 
ces.  In  the  1980s,  the  cost  of  applying  dense  matrices  resulting  from  potential  theory  to  arbitrary  vectors 
was  greatly  reduced  by  the  development  of  “fast”  algorithms  (Fast  Multipole  Methods,  panel  clustering, 
wavelets,  etc.).  Combining  fast  matrix-vector  multiplication  techniques  with  iterative  schemes  for  the  solu¬ 
tion  of  large-scale  systems  of  linear  algebraic  equations,  it  became  possible  to  solve  well-conditioned 
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boundary  integral  equations  of  potential  theory  in  0(n)  operations,  where  n  is  the  number  of  unknowns. 
Today,  such  combinations  arc  in  many  environments  the  fastest  and  most  accurate  numerical  solution  tech¬ 
niques  available.  Iterative  linear  solvers  have  certain  drawbacks  though;  we  briefly  discuss  these  below. 

(1)  The  number  of  iterations  required  by  an  iterative  solver  is  sensitive  to  the  spectral  properties  of  the 
matrix  of  the  system  to  be  solved;  for  sufficiently  ill-conditioned  problems,  the  number  of  iterations 
is  proportional  to  n.  Since  each  iteration  (with  FMM  acceleration)  requires  0(n)  operations,  the  total 
operation  count  is  then  proportional  to  n2.  While  this  is  still  better  than  the  0(n3)  estimate  associated 
with  direct  solvers,  in  many  situations  0(n2)  is  not  acceptable. 

(2)  When  one  needs  to  solve  a  collection  of  problems  involving  a  single  matrix  and  multiple  right-hand 
sides,  the  CPU  time  requirements  of  most  iterative  algorithms  are  close  to  the  time  required  to  solve 
one  problem  multiplied  by  the  number  of  problems  to  be  solved.  With  most  direct  solvers,  the  situation 
is  different;  once  the  matrix  has  been  inverted  (or  factored),  applying  its  inverse  to  each  additional 
right-hand  side  is  very  inexpensive. 

(3)  When  a  collection  of  linear  systems  has  to  be  solved  whose  matrices  are  in  some  sense  “close”  to  each 
other,  iterative  algorithms  derive  little  (if  any)  advantage  from  the  closeness  of  the  matrices. 

(4)  Most  direct  schemes  for  the  solution  of  linear  systems  are  closely  related  to  efficient  algorithms  for  the 
construction  of  their  Singular  Value  Decompositions  and  certain  other  matrix  factorizations  (L-U, 
Q-R,  etc.).  The  simplest  such  scheme  is  probably  the  inverse  power  method  with  shifts  (see,  for  exam¬ 
ple,  [6]),  which  converts  any  algorithm  for  the  solution  of  a  linear  system  into  an  algorithm  for  the 
determination  of  a  prescribed  singular  value.  Iterative  techniques  do  not  provide  such  a  capability, 
except  via  the  so-called  Lanczos  schemes,  which  tend  to  require  a  large  number  of  iterations  (see, 
for  example,  [14]). 

The  subject  of  this  paper  is  a  numerical  technique  that  is  intended  to  overcome  these  shortcomings  by 
directly  producing  a  compressed  (“data-sparse”)  factorization  of  the  inverse  of  the  matrix.  When  applied 
to  contour  integral  equations  of  potential  theory  whose  kernels  are  non-oscillatory,  the  asymptotic  com¬ 
plexity  of  the  solver  is  typically  O (n  log*  n),  where  k  depends  on  the  geometry  and  the  kernel  (tc  =  1  or 
2).  When  applied  to  problems  involving  oscillatory  kernels,  the  asymptotic  complexity  deteriorates  as 
the  wavenumber  increases  but  the  scheme  remains  viable  for  objects  up  to  a  few  hundred  wavelengths  in 
size.  The  factorization  technique  described  in  this  paper  is  a  multilevel  extension  of  the  compression  tech¬ 
nique  described  in  [3],  The  machinery  underlying  these  techniques  applies  generally  to  matrices  with  rank- 
deficient  off-diagonal  submatrices;  contour  integral  equations  have  been  chosen  by  the  authors  simply  as 
the  most  straightforward  application. 

It  is  not  the  purpose  of  this  paper  to  provide  an  exhaustive  survey  of  the  literature  on  the  subject  we  are 
addressing.  A  number  of  researchers  have  observed  that  matrices  with  rank-deficient  off-diagonal  blocks 
admit  “fast”  factorizations  (see  [8,9]);  others  have  constructed  “fast”  algorithms  in  various  environments 
(see  [1,2,4,5,12])  where  the  operators  in  question  posses  rank-deficient  off-diagonal  blocks,  without  using 
this  property  explicitly.  However,  we  observe  that  the  algorithm  of  this  paper  is  closely  related  to  the 
scheme  described  in  [13].  In  fact,  our  algorithm  could  be  viewed  as  a  modification  of  the  algorithm  of 
[13]  that  replaces  “elongated”  objects  in  two  or  three  dimensions  with  “curves”,  extends  the  class  of  kernels 
addressed  by  [13],  and  introduces  modifications  in  the  scheme  of  [13]  that  are  necessary  for  this  extension  to 
work. 

The  paper  is  organized  as  follows:  In  Section  2  we  introduce  our  notation  and  list  certain  facts  about 
compression  of  rank-deficient  matrices.  In  Section  3  we  demonstrate  that  the  inverse  of  a  matrix  with 
rank-deficient  off-diagonal  blocks  possesses  a  data-sparse  hierarchical  factorization.  In  Section  4  we  present 
a  generic  numerical  technique  for  constructing  the  factorization  described  in  Section  3.  In  Section  5  we 
show  how  the  generic  numerical  technique  presented  in  Section  4  can  be  improved  further  when  applied 
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to  contour  integral  equations.  In  Section  6  we  illustrate  through  numerical  examples  the  efficiency  of  the 
technique  presented  in  Section  5  when  applied  to  a  number  of  different  kernels  and  contours.  In  Section 
7  we  summarize  our  findings  and  discuss  possible  extensions  and  generalizations. 


2.  Preliminaries 


2.1.  Notation 


Throughout  the  paper,  we  use  upper  case  letters  for  matrices  and  lower  case  letters  for  vectors  and  sca¬ 
lars.  The  canonical  unit  vectors  in  C"  are  denoted  by  ej.  Given  a  matrix  X  e  Cmx",  we  let 
X*  denote  its  adjoint  (the  complex  conjugate  transpose), 
ok(X)  denote  its  A'th  singular  value, 

||Aj|2  denote  its  f  operator  norm, 

IIA'IIf  denote  its  Frobenius  norm,  and 
xj  €  Cmxl  denote  its  yth  column. 


Given  matrices  A,  B,  C  and  D  we  let 


\AB), 


FA 

C 


and 


A 

C 


denote  larger  matrices  obtained  by  stringing  the  blocks  A,  B,  C  and  D  together. 
Definition  1.  (Permutation  vectors)  Given  a  positive  integer  n,  we  define 
J„  =  the  set  of  permutations  of  the  integers  {1, . . . 

Given  two  integers  k  and  tt  such  that  1  <  k  <  n,  we  define 
J*  =  the  set  of  subsets  of  size  kof  elements  of  J„. 

n 

In  other  words,  if  J  €  J*,  then  J  is  a  vector  of  integers 

J  =  \j\i  •  •  •  iA]> 

where  1  <  ^  «  and  all  j{ s  are  different. 


(2.1) 


(2.2) 

(2.3) 

(2.4) 


Definition  2.  (Submatrix)  When  we  use  the  term  “submatrix”  we  do  not  insist  that  the  submatrix  must 
form  a  contiguous  block.  To  be  precise,  we  say  that  B  e  C*w  is  a  submatrix  of  A  €  C'”XB,  if  there  exist 
permutations  /  =  [it , . . . ,  »*]  e  J*  and  J  .  J,}  €  J'  such  that 

bPq  =  ai„j,,  forp=  l,...,k,  q  =  1,...,/.  (2.5) 

Definition  3.  (Neutered  rows  and  columns)  Let  A  be  a  matrix  consisting  of pxp  blocks, 


A  = 


„(U)  ... 


We  refer  to  the  submatrix  formed  by  all  blocks  on  the  zth  row  except  the  diagonal  one,  i.e. 
as  the  zth  neutered  row  of  blocks.  A  neutered  column  of  blocks  is  defined  analogously. 


(2.6) 


(2.7) 
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2.2 .  Compression  of  matrices 

In  this  section  we  state  a  theorem  on  matrix  compression  that  forms  the  foundation  of  the  matrix  fac¬ 
torization  technique  presented  later  in  this  paper.  Roughly  speaking,  the  theorem  asserts  that  given  a  ma¬ 
trix  A  of  rank  k,  it  is  possible  to  pick  k  of  its  columns  in  such  a  fashion  that  they  form  a  well-conditioned 
basis  for  the  remaining  columns.  It  was  first  reported  in  slightly  different  form  in  [7]. 

Theorem  1.  Given  an  arbitrary  matrix  A  €  Cmxn  and  an  integer  k  such  that  1  ^k<  min  (m,n),  there  exists  a 
(not  necessarily  unique)  matrix  T  €  and  a  permutation  J  =  \fu . .  .Jn]  £  such  that 

A2=AlT  +  E.  (2.8) 

Here,  A]  and  A2  are  matrices  formed  by  the  columns  of  A, 

A,  =  W . «„]  €  C-*, 

Al  =  K . 

the  elements  of  the  matrix  T  6  C*x(n“*)  satisfy 
\TtJ\  <  1,  fori  Ki<k,  l^j^n-k, 
and  the  matrix  E  €  Cmx{"~k)  satisfies  the  inequality 
||£||2  <  aM{A)s/TTk{n^k), 
where  Oj,+i(A)  is  the  ( k  +  1  )th  singular  value  of  A. 

Remark  4.  (Computational  complexity)  While  Theorem  1  asserts  the  theoretical  existence  of  a  matrix  T 
and  a  permutation  J  with  certain  properties,  it  does  not  address  the  question  of  how  to  determine  these 
numerically.  In  fact,  the  authors  are  not  aware  of  any  algorithm  that  finds  these  objects  in  polynomial  time. 
However,  in  [7]  an  algorithm  is  presented  that  finds  a  matrix  T  and  a  permutation  J  such  that  all  statements 
of  Theorem  1  still  hold,  except  that  (2.10)  and  (2.11)  are  replaced  by  the  weaker  inequalities 

|  Tu  |<  Vh,  for  1  <  /  ^  k,  1  ^j^n-k  (2-12) 

and 

||£||2  <  <Jk+\(A)\/l  Tnk{n  -  k).  (2.13) 

When  m  >  n,  the  computational  complexity  of  this  algorithm  is  typically  O (mnk),  the  same  as  for  the 
pivoted  gR-factorization.  In  rare  cases,  the  computational  complexity  may  be  somewhat  larger  but  it  never 
exceeds  O (mn2). 


(2.9) 

(2.10) 

(2.11) 


Observation  5.  (Column  compression)  When  applied  to  a  matrix  A  6C"”  of  rank  k,  Theorem  1  asserts 
that  there  exists  a  well-conditioned  column  operation  that  leaves  k  of  the  columns  of  A  unchanged  while 
mapping  the  remaining  n  -  k  columns  to  zero.  More  specifically,  let  us  define 


R  =  Pj 


7 

0 


(2.14) 


where  T  and  J  are  defined  in  Theorem  1  and  the  permutation  matrix  Pj  is  defined  by 


Pj  =  [«/,>•• -i<7 J 


€  C"x". 


Then 


(2.15) 
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AR  =  [4cs0]  €  Cmx", 

where  the  “column  skeleton”  /lcs>  is  formed  by  k  of  the  columns  of  A; 

Acs  =  h„...,aJt]€Cmxi. 

Moreover,  by  virtue  of  (2.10)  and  the  identity 

■  \I  T1  r,. 

'  "[«  /p 

it  is  clear  that 

||/f||F  <  y/n  +  k{n-k),  and  ||/r*||F  <  y/n  +  k(n-  k). 


(2.16) 

(2.17) 

(2.18) 


(2.19) 


Observation  6.  (Row  compression)  The  argument  of  Observation  5  can  equally  well  be  applied  to  the  rows 
of  a  matrix  A  G  Cmx"  of  rank  k.  Thus,  there  exists  a  matrix  L  G  Rmxm  such  that 


LA  = 


^4rs 

0 


e  cmx", 


where  the  “row  skeleton”  Ars  €  C*x"  is  formed  by  k  of  the  rows  of  A  and 


||Z,||F  <  \/m  +  k(m  -k)  and  \\L  ’Up  <  y/m  +  k(m-k). 


(2.20) 


(2.21) 


3.  Analytical  apparatus 


Consider  a  p  x  p  block  matrix 


A(ii) 


A  = 


A(pi)  . . .  AM 


(3.1) 


such  that  any  neutered  row  or  column  of  blocks  is  rank-deficient.  In  this  section  we  derive  compressed  fac¬ 
torizations  of  the  inverse  of  such  a  matrix.  Lemmas  2  and  3  provide  factorizations  for  the  case  p  =  2.  Obser¬ 
vation  8  extends  the  results  of  Lemma  3  to  a  general  p.  Observation  9  introduces  hierarchical  factorizations 
that  further  improve  the  efficiency. 

T-ftmma  2  below  asserts  that  for  a  given  2x2  block  matrix  with  rank-deficient  off-diagonal  blocks,  there 
exist  well-conditioned  row-  and  column-operations  that  (i)  introduce  zeros  in  the  off-diagonal  blocks  and 
(ii)  leave  the  remaining  elements  in  the  off-diagonal  blocks  untouched. 


Lemma  2.  Let  A  be  a  non-singular  matrix 


A<  I2> 
A™ 


^  j\(n+m)x(/»+/n) 


(3.2) 


where  A(n)  G  C"*\A{22)  G  Cmxm  and  the  off-diagonal  blocks  A('2)  G  CBX'V(2I)  G  CmXB  have  rank 
k  <  min(m,n).  Then  there  exist  matrices  R,L  €  CBXB  such  that 


I/:;.. 
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Xu  *.2  4's2)l 
*2,  *22  0 

f<2’>  0  a™ 


L  O' 

A02)- 

R  O' 

.0  /. 

/ 2 ')  a<22\ 

.0  7. 

(3.3) 


Here,  the  matrix  A{^  €  C*xm  consists  of  k  of  the  rows  of  A(n)  and  the  matrix  A^  £  Cmx*  consists  of  k  of  the 
columns  of  A(2X\  Moreover,  Xu  €  Ckxk,Xu  €  R*x(""*),*2i  €  IR(”"*)x*,*22  €  #  * ,x("  k),  and  the  matrices  R 
and  L  satisfy  (2.19)  and  (2.21),  respectively. 

Proof.  Due  to  Observations  5  and  6,  there  exist  matrices  R,L  £  Cnx"  such  that 


LA"*  = 


and  A(2,)R  =  oj , 


(3.4) 


where  and  ^cs*  are  submatrices  of  A(12)  and  A(2l),  respectively.  The  identity  (3.3)  now  follows  by 
partitioning 


R  =  [/?i  R2], 


where  L\  £  Ckxn,  L2  £  C^k)xn, 
where  7?,  £  Cnxk,  R2  €  Cx{n-k\ 


(3.5) 


and  setting 


Xu  =  LxA{n)R\  £  Ckxk, 

Xl2=LiA{u)R2£Ckx^n-k),  (36) 

X2,  =  L2A(,l)R,  £  c(a~k)xk, 

X22=L2A(l,)R2€C("-k)x("-k).  □ 

The  following  lemma  uses  the  results  of  Lemma  3  to  reduce  the  problem  of  factoring  the  inverse  of  the 
matrix  A  in  (3.2)  to  the  problem  of  factoring  the  inverse  of  the  smaller  matrix  A  in  (3.8). 

Lemma  3.  Let  A,X X12,  X21,  X22,  A^  andAcs  be  as  in  Lemma  2.  Provided  that  the  matrix  X22  in  (3.3)  is 
non-singular,  there  exist  matrices  B  £  Cnxk,  C  £  C*x"  and  D  £  Cnx"  such  that 


A-'  = 

B  01 

—1 

C  O' 

D  01 

o 

_ 1 

A 

.0  7. 

+ 

i - 

o 

o 

where 


A  = 


i(,,) 

A(l2) 

^RS 

MD 

LAcs 

A™ 

£(Jt+m)x(*+m) 


(3.7) 


(3.8) 


and 

A(n)  =  Xn  - *,2*22*21  6  Ctxt.  (3.9) 

Proof.  We  let  Lh  Li,  R\  and  R2  be  defined  by  (3.5).  Inverting  both  sides  of  Eq.  (3.3),  we  obtain  the  identity 


Remark  7.  (Symmetric  factorizations)  It  is  possible  to  force  the  factorization  (3.7)  to  be  symmetric  in  the 
sense  that  R  =  L*  (which  does  not  imply  that  C  =  B*  unless  A  itself  is  Hermitian).  To  this  end,  we  define  L 
and  Jr  as  the  matrix  and  index  vector  that  compress  the  rows  of  the  matrix  [A(1^A^  ]  e  R"x2m  (rather 
than  the  rows  of  A(n)  alone),  and  set  R  =  L*  and  Jc  =  Jr.  This  modification  typically  results  in  a  poorer 
compression  ratio  but  may  dramatically  improve  the  conditioning  of  the  transformation  matrices,  as 
discussed  in  Section  4.4. 

Observation  8.  (One-level  compression  of  a  block  matrix)  Consider  a  matrix 

-Aw  ...  aW 

A  =  :  :  €  €rxpn,  (3.13) 

^(pi)  . . .  ^(pp) 

where  A^  €  C**"  for  ij  =  1  We  assume  that  any  neutered  row  or  column  of  blocks  has  rank  at  most 
k.  Through  p  applications  of  Lemma  3,  it  is  possible  to  reduce  the  problem  of  inverting  A  to  the  problem  of 
inverting  the  smaller  matrix 

...  a"*' 

A=  :  :  (3.14) 

AW  ■■■A(PP\ 

where  A  {'J)  e  Cixi  for  ij  =  1, . .  .,p,  and  A^p  is  a  submatrix  of  A^  whenever  i  ^  j. 

More  specifically,  applying  Lemma  3  to  each  of  the  p  diagonal  blocks  of  A,  we  obtain  the  factorization 


C: 

i 
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’B , 

0  • 

0  ■ 

■c, 

0  • 

•  0  • 

‘A 

0  • 

••  O' 

A~'  = 

0 

B2 

0 

A~' 

0 

C2 

0 

+ 

0 

A 

0 

.  0 

0  • 

.  0 

0  • 

.  0 

0  • 

••  Dp. 

where  5,  €  Cnxk,  C ,  €  C*XB  and  D,  €  CBXB,  for  i  =  1 . p. 

The  single-level  matrix  compression  is  illustrated  graphically  in  Fig.  1 . 


(3.15) 


Observation  9.  (Hierarchical  compression  of  a  block  matrix)  Observation  8  reduces  the  problem  of  inver¬ 
sion  of  a  block  matrix  with  rank-deficient  neutered  rows  and  columns  to  the  problem  of  inversion  of  a 
block  matrix  with  smaller  blocks.  If  by  clustering  these  smaller  blocks,  we  can  create  a  matrix  with  off-diag¬ 
onal  rank-dcficiencics,  then  the  process  can  be  repeated  recursively  to  further  improve  the  compression. 

More  specifically,  let  us  change  notation  so  that  the  objects  labeled  A,  A  and  k  in  Observation  8  are  now 
labeled  AW,A(1)  and  ku  respectively.  Eq.  (3.15)  then  reads 


(^»)_1  =2?<I>(i(1))',C‘1>+D(,\  (3.16) 

where  B/l\  C*1*,  D(i)  are  block  diagonal  matrices  whose  p  diagonal  blocks  are  of  sizes  n  x  k\,  k\  x  n,  n  x  n, 
respectively.  Wc  then  cluster  the  blocks  of  the  matrix  A^}  to  form  a  matrix  A(2)  with  (p/2)  x  (p/2)  blocks  of 
size  2k\  x  2kj  and  apply  the  factorization  (3.16)  to  it,  thus  obtaining  a  telescoped  factorization 


(A^)~'=B^  2?<2>(i(2))~‘c'2>  +  A2> 


C(1>  +Z)(I). 


(3.17) 


Here,  arc  all  block  matrices  with  (p/2 )  x  (p/2)  blocks.  Letting  k2  denote  the  rank  of  the 

neutered  rows  and  columns  of  A(2),  the  blocks  of  A(2)  have  size  k2  x  k2,  while  ]P\  (P\  D(2)  are  diagonal 
block  matrices  with  diagonal  blocks  of  sizes  2 kxxk2,  k2x2kx  and  2^x2^,  respectively.  This  process 
can  be  continued  until  no  further  clustering  is  advantageous. 

The  multi-level  matrix  compression  is  illustrated  graphically  in  Fig.  2. 


Remark  10.  (Adjoint  of  the  inverse)  Obviously,  the  factorizations  (3.15)  and  (3.17)  provide  a  mechanism 
for  the  accelerated  application  of  both  A~]  and  [A~1]*. 

Remark  11.  (Block  sizes)  In  Observations  8  and  9,  it  was  assumed  that  all  blocks  within  one  of  the  matri¬ 
ces  A,  A,  A (1),  A(2\  . . have  the  same  size.  This  assumption  was  made  for  notational  convenience  only  and 
is  in  no  way  essential  to  the  results. 


Fig.  1.  A  3x3  matrix  [Aw^Ja;]  is  compressed  in  three  steps,  cf.  Observation  8.  In  step  j-  1,  2,  3,  the  single-block  compression  of 
Lemma  3  is  applied  to  compress  the  interaction  between  and  the  rest  of  the  matrix.  Black  blocks  represents  entries  that  have  not 
been  changed  beyond  row  and  column  permutations  and  gray  represents  entries  that  have  been  updated  but  are  not  (necessarily)  zero. 
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Fig.  2.  An  8  x  8  block  matrix  is  compressed  through  a  three-level  compression  scheme  in  the  vein  of  Observation  9.  The  gray  scale 
coding  is  the  same  as  in  Fig.  1 . 


4.  An  algorithm  for  the  computation  of  a  compressed  inverse 

In  Section  3  we  demonstrate  the  existence  of  a  compact  factorization  of  the  inverse  of  any  block  matrix 
whose  neutered  rows  and  columns  of  blocks  are  rank-deficient.  In  this  section,  we  describe  a  numerical 
scheme  for  the  construction  of  such  factorizations,  and  estimate  its  efficiency. 

Remark  12.  The  inversion  scheme  presented  in  this  section  is  fairly  generic,  depending  only  on  the  ranks  of 
off-diagonal  blocks  of  the  matrix  to  be  inverted.  In  situations  where  the  structure  of  the  matrix  is  known, 
further  improvements  are  possible.  For  instance,  when  applied  to  a  dense  nxn  matrix  resulting  from  the 
discretization  of  a  contour  integral  operator,  the  generic  algorithm  of  this  section  requires  0(n2)  arithmetic 
operations  to  construct  its  inverse,  while  the  customized  technique  presented  in  Section  5  requires 
O (n  log2  n)  operations  or  less,  depending  on  the  integral  operator. 

4. 1.  Single  block  compression 

Lemmas  2  and  3  assert  that  the  inverse  of  a  2  x  2  block  matrix  of  the  form  (3.2)  can  be  factored  in  the 
compressed  form  (3.7).  The  quantities  A(U),  R,  L,  ^'s2)  and  that  appear  in  (3.7)  can  be  determined  by 
taking  the  following  steps: 


(1)  Determine  a  matrix  L  €  C”x"  and  a  permutation  JR  €  J*  such  that 


,L4<12)  = 


1 


where  A(^  is  formed  by  the  k  rows  of  A(12>  specified  by  Jr,  as  described  in  Observation  6. 
(2)  Determine  a  matrix  R  €  C"x"  and  a  permutation  Jc  €  J*  such  that 

A{2l)R  =  [/f^  o], 

where  Aq $  is  formed  by  the  columns  of  A(21)  specified  by  Jc,  as  described  in  Observation  5. 
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(3)  Partition  R  and  L  as  specified  in  (3.5)  and  form  the  blocks  Xy  as  in  (3.6). 

(4)  Compute  .<4(n),  B,  C  and  D  using  the  formulas  (3.9)  and  (3.12). 

Steps  (1)  and  (2)  require  O (mnk)  floating  point  operations  while  steps  (3)  and  (4)  require  0(«3)  opera¬ 
tions.  The  total  cost  is  thus  O (mnk  +  ti3). 

4.2.  Single-level  compression 

Let  A  denote  a  matrix  consisting  of  p  xp  blocks,  each  of  size  n  x  n,  in  which  every  neutered  row  or  col¬ 
umn  has  rank  k  such  that  k<n.  Observation  8  states  that  such  a  matrix  can  be  factored  in  the  sparse  form 
(3.15).  This  factorization  contains  the  entities  2?„  C„  Dh  for  ij  =  1,  ...,p,  which  can  be  computed 
through  p  applications  of  the  single-block  compression  technique  of  Section  4.1  -  one  application  for  each 
diagonal  block.  Each  one  of  the  p  steps  requires  O (pkn2  +  n3)  floating  point  operations  resulting  in  a  total 
computational  cost  of  O (p2kn2  +  pn3). 

Remark  13.  The  off-diagonal  blocks  of  the  compressed  matrix  A  are  never  explicitly  computed.  Instead, 
the  blocks!  €  C*x*  is  specified  by  giving  the  index  vectors  7^,7^  €  J*  that  define  the  rows  and 
columns  of  A(iJ)  6  C*x*,  whose  intersections  form  Ai(i).  (Here  is  the  index  vector  obtained  when 
compressing  the  ith  row  of  blocks  and  J^)  is  the  index  vector  obtained  when  compressing  the  /th 
column  of  blocks.) 


4.3.  Multi-level  compression 


The  single-level  technique  compresses  a  block  matrix  A  to  form  another  block  matrix  A  with  smaller 
blocks.  Now,  if  by  clustering  blocks,  we  can  create  rank-deficiencies  in  the  neutered  rows  and  columns 
of  A,  then  the  single-level  technique  can  be  applied  recursively.  The  algorithmic  implementation  entirely 
follows  the  description  in  Observation  9. 

When  estimating  the  computational  cost  for  the  multi-level  technique  we  use  r  =  1, . . .,  R  as  an  index  for 
the  levels  (with  r  =  1  being  the  finest  level),  we  let  pr  denote  the  number  of  blocks  on  level  r,  nr  the  average 
block  size  and  kr  the  average  rank.  The  cost  for  step  r  is  then 

tr~krp2rn2+prn3r.  (4.1) 

We  assume  that  pjcr  >  nr  so  that  the  second  term  is  dominated  by  the  first.  Using  that  p,kr  =  pr+ i«r+ 1, 
we  then  find  that  the  total  cost  for  all  R  steps  is 

R  R 

T~Z‘r~  (4-2) 

r=\  r—1 


At  each  level,  the  number  of  blocks  is  cut  in  half,  so 

n  - 

P r  2r“  1 

We  let  yr  =  nr+  \llnr  denote  the  compression  ratio  so  that 
«r  =  ( 2yr_,)  -  -  -  (2y,)n,. 

Assuming  that  there  exists  a  constant  y  such  that  yr  ^  y,  we  obtain  the  bound 
<  (2y)r-1«i- 


(4.3) 

(4.4) 


(4.5) 
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Combining  (4.2),  (4.3)  and  (4.5),  we  find  that  the  total  cost  is 


T  ~  Y)  r  (2y)'n'  ~  -Pi"!  £  (2y3)r- 

r~\  Z  1  r=l 


(4.6) 


We  assume  that  y  <  v^4  =  0.7937  -so  that  the  sum  is  boimded  by  (1  -  2y3)  '.  Letting  N denote  the  size  of 
the  matrix  we  find  that  N  -  p\n\  and  thus 


T~N2rn. 


(4.7) 


The  assumption  that  (4.5)  holds  for  some  y  <  0.7939-  •  •  is  valid  in  many  environments  relating  to  discret¬ 
ization  of  contour  integral  equations.  We  will  return  to  this  point  in  Section  6. 


4.4.  Conditioning 

All  factorizations  computed  in  this  section  are  variations  of  (3.15).  For  this  formula  to  be  of  practical 
use,  the  matrices  Bh  C,-  and  Z>,  must  not  be  excessively  large  (in  say  the  / 2  operator  norm)  and  the  condition 
number  of  A  has  to  be  similar  to  that  of  A.  The  formulas  (3.12)  imply  that  this  is  true  if  \\X22  ||2  is  °f  mod¬ 
erate  size  (since  (2.19)  and  (2.21)  assert  that  R  and  L  are  well-conditioned).  Under  the  assumptions  of  this 
section  (that  the  global  matrix  be  non-singular  and  the  off-diagonal  blocks  have  low  rank)  it  is  not  possible 
to  prove  any  such  bound. 

However,  in  the  context  of  contour  integral  equations,  the  problem  can  largely  be  avoided  by  enforcing 
that  the  compression  be  symmetric  in  the  sense  of  Remark  7.  The  reason  is  that  the  diagonal  blocks  of  the 
original  matrix  tend  to  have  the  form 

A{n)=D  +  E,  (4.8) 

where  D  is  a  positive  definite  Hermitian  matrix  and  E  is  “small”  compared  to  D  in  operator  norm.  Since 
R2  =  L\  when  symmetry  is  enforced,  we  find  that,  cf.  (3.6), 

Xn  =  L2{D  +  E)L\  =  (i2Z>,/2)  (L2D''2)*  +  L2EV2.  (4.9) 

Here,  the  first  term  is  well-conditioned,  and  the  second  has  at  most  a  few  non-small  singular  values.  Thus,  it 
is  very  unlikely  that  the  sum  of  the  two  matrices  should  have  any  small  singular  values.  Furthermore, 
should  such  a  coincidence  happen,  the  algorithm  detects  it  and  avoids  the  problem  by  locally  re-partition¬ 
ing  the  matrix. 


4.5 .  Error  estimation 

Given  a  prescribed  accuracy  e,  the  numerical  scheme  presented  in  this  section  solves  the  equation 
Au=f  (4.10) 

by  constructing  an  approximation  A£  that  satisfies 

IM-^||2<c  (4.11) 

and  is  such  that  the  approximate  solution  uc  =  A~'f  can  be  computed  fast.  The  error  in  u  satisfies 

u-uc  =  (A~'  -A;t)/  =  A;'(Ae-A)A-'/  =  A;l(Ae-A)u.  (4.12) 

The  relative  error  is  therefore  bounded  as  follows: 

i!^^<iM:'(^-^)iia<E|Mf:'il2- 


(4.13) 


r 

it. 
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While  the  algorithm  cannot  possibly  control  \\AC  1 1|2,  this  quantity  can  be  computed  cheaply  using  power 
iteration,  see  Remark  10.  Thus,  an  assured  bound  for  the  relative  error  can  be  computed  a  posteriori. 


5.  An  accelerated  algorithm  applicable  to  contour  integral  equations 

The  bulk  of  the  computational  cost  of  the  matrix  compression  technique  presented  in  Section  4  consists 
of  the  cost  of  determining  index  vectors  and  transformation  matrices  that  compress  the  neutered  rows  and 
columns.  When  the  matrix  under  consideration  is  a  discrete  approximation  of  a  contour  integral  operator, 
it  is  possible  to  determine  these  quantities  through  an  entirely  local  operation  whose  cost  only  depends  on 
the  size  of  the  diagonal  block  to  be  compressed  (i.e.,  not  on  the  size  of  the  rest  of  the  matrix).  This  is  pos¬ 
sible  since  the  column  and  row  operations  employed  in  the  present  matrix  compression  technique  do  not 
update  the  elements  of  the  off-diagonal  blocks,  as  discussed  in  Remark  13. 

This  section  is  structured  as  follows:  In  Section  5.1  we  describe  a  single-block  compression  technique  for 
the  boundary  integral  equations  associated  with  Laplace’s  equation  in  two  dimensions  that  is  faster  than 
the  generic  single-block  technique  of  Section  4.1.  In  Section  5.2  we  describe  single  and  multi-level  tech¬ 
niques  for  contour  integral  equations  obtained  by  repeated  application  of  the  single-block  compression 
technique  of  Section  5.1.  Section  5.3  discusses  generalizations  of  the  technique  to  other  equations  of  poten¬ 
tial  theory. 

Remark  14.  (Numerically  rank-deficient  matrices)  In  this  section,  we  say  that  a  matrix  has  rank  k 
provided  that  it  has  only  k  singular  values  that  are  larger  than  some  preset  accuracy.  In  other  words,  we  do 
not  distinguish  between  what  is  sometimes  called  “numerical  rank”  and  actual  rank. 

5.1.  Single-block  compression 

The  following  observation  summarizes  the  principle  finding  of  this  section: 

Observation  15.  Let  the  matrix  A  in  (3.2)  represent  the  discretization  of  the  integral  operator 

J  K(x,y)u(y)ds(y),  for  x  €  T,  (5.1) 

where  r  =  +  T 2  is  a  contour  (Fig.  3  shows  one  example),  the  block  structure  of  A  corresponds  to  the 

partitioning  of  f  (so  that,  e.g.,  A(12)  represents  evaluation  on  Ti  of  the  potential  generated  by  a  charge  dis¬ 
tribution  on  fa),  and  K  is  the  kernel  of  a  single  and/or  double  layer  potential  for  the  Laplace  operator.  Then 
under  very  mild  assumptions  on  the  contour  F,  the  factorization  (3.3)  can  be  computed  using  0(n3)  floating 
point  operations,  where  n  is  the  number  of  points  used  in  the  discretization  of  f\. 


Fig.  3.  A  contour  F.  In  figure  (a),  the  partitioning  r  =  Fj  +  r2  is  shown  with  F]  drawn  with  a  bold  line.  In  figure  (b)  the  contour  f2  is 
drawn  with  a  thin  solid  line  and  rcxt  with  a  dashed  line. 
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The  idea  behind  the  construction  alluded  to  in  Observation  15  is  simple:  Instead  of  compressing  the 
interaction  between  T\  and  F 2,  it  is  sufficient  to  compress  the  interaction  between  F\  and  a  small  contour 
f2,  formed  by  the  union  of  an  artificial  circular  contour  enclosing  F \  and  the  part  of  F2  that  is  inside  this 
circle  (as  shown  in  Fig.  3(b)).  The  reason  is  that  by  virtue  of  Green’s  theorem,  any  potential  field  generated 
by  charges  on  r 2  can  equally  well  be  generated  by  charges  on  r2.  Finally  we  note  that  if  T\  is  discretized 
using  n  nodes,  then  typically  r2  can  be  discretized  using  0(n)  nodes,  yielding  a  total  cost  for  the  procedure 
of  0(n3). 

The  remainder  of  this  subsection  is  devoted  to  substantiating  Observation  15.  We  start  by  introducing 
some  notation;  let  rcirc  denote  the  circle  in  Fig.  3(b)  and  let  rex t  denote  the  part  of  f2  outside  of  rcirc.  Fur¬ 
thermore,  let  Sr2-*r,  denote  the  integral  operator  that  evaluates  a  potential  on  F 1  caused  by  a  charge  dis¬ 
tribution  on  r2.  In  other  words,  Sr2~r,  acts  on  a  charge  distribution  u  as  follows: 

[Sr2^r,w]W  -  J  K{x,y)u(y)ds(y),  for  x  €  Ft.  (5.2) 

Observation  15  rests  on  the  following  claim: 

Lemma  4.  Let  H  €  Cn*"'  denote  the  matrix  discretizing  ,  and  let  the  index  vector  Jr  6  J*  and  the 

transformation  matrix  L  be  such  that  they  compress  H  in  the  sense  of  Observation  6.  Then  Jr  and  L  also 
compress  the  matrix  B  €  Cnxm  that  approximates  the  operator  Sral->r ,  ■ 

Sketch  of  proof.  It  is  sufficient  to  prove  that  there  exists  a  matrix  W  €  C"'xm  with  moderate  lz  operator 
norm  such  that 


B  =  HW.  (5.3) 

(The  matrix  W is  the  matrix  that  maps  a  charge  distribution  on  Fext  to  an  equivalent  charge  distribution  on 
Fcirc.)  Now,  Eq.  (5.3)  is  the  discrete  approximation  of  the  operator  relation 

sr„,-r,  =  Sr^-.r,  [(Sr^-r^J  lS/-CM_rcirc]-  (5-4) 

The  matrix  IF  in  (5.3)  corresponds  to  the  operator  in  square  brackets  in  (5.4).  That  this  operator  is  bounded 
is  a  consequence  of  Green’s  theorem.  □ 


5.2.  Single-  and  multi-level  compression 


The  generic  single-  and  multi-level  compression  techniques  of  Sections  4.2  and  4.3  were  obtained  by  re¬ 
peated  application  of  the  single-block  technique  described  in  Section  4.1.  Single-  and  multi-level  techniques 
for  contour  integral  equations  are  analogously  obtained  by  repeated  application  of  the  single-block  tech¬ 
nique  of  Section  5.1. 

It  remains  to  estimate  the  computational  cost  of  the  accelerated  compression  technique.  The  cost  for  a 
single  level  compression  at  level  r  =  1, . . .,  R  is  now,  cf.  (4.1), 

tr~prn\ ,  (5.5) 

where  pr  denotes  the  number  of  clusters  on  level  r  and  nr  is  the  (average)  cluster  size.  Under  the  assumptions 
(4.3)  and  (4.5),  we  find  that 


tr 


3r— 3 


n 


3 

p 


The  total  cost  for  all  R  steps  is  then 


(5.6) 
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r=rl  r^\ 


(5.7) 


We  assume  that  y  <  4~ 1/3  =  0.630-  •  •  so  that  the  sum  is  bounded  by  (1  -  4y3)  '.  Letting  N  denote  the  size  of 
the  original  matrix,  we  find  that  N  =  and  thus 


T  ~  Nn\. 


(5.8) 


When  the  kernel  of  the  equation  is  associated  with  the  fundamental  solution  of  Laplace’s  equation,  it  is 
possible  to  prove  that  the  assumption  (4.5)  holds  with  y  ~  1/2  when  «i  ^  log  N,  which  gives  an  upper 
bound  on  the  computational  cost  of  0(iVlog2  N).  However,  further  acceleration  is  achieved  by  choosing 
a  smaller  even  though  the  cluster  size  then  grows  slightly  in  the  first  couple  of  compressions.  This  ex¬ 
plains  why  the  log2  N  factor  is  not  visible  in  the  experiments  in  Section  6. 


Remark  16.  The  single-block  compression  technique  described  in  Observation  15  requires  the  algorithm  to 
determine  which  of  the  nodes  of  r2  lie  inside  the  artificial  circle  rdrc.  If  this  search  would  be  done  by  brute 
force,  the  computational  cost  for  a  single  level  solve  would  include  a  term  p2n2,  cf.  (5.5).  Even  though  the 
constant  in  front  of  this  term  is  small,  it  would  dominate  the  computation  for  large  problems  (in  our 
implementation,  this  would  happen  for  N  ^  25000).  One  solution  to  this  problem  is  to  perform  the  search 
via  a  hierarchical  search  tree;  the  estimate  (5.5)  then  remains  valid. 


5.3.  Generalizations 

The  technique  presented  in  Section  5.1  for  Laplace’s  equation  is  readily  applicable  to  other  equations  of 
classical  potential  theory;  Helmholtz,  Yukawa,  Shrodinger,  Maxwell,  Stokes,  elasticity,  etc.  The  only  com¬ 
plication  occurs  when  working  with  equations  that  may  have  resonances.  In  such  cases,  it  is  possible  that 
the  operator  of  self-interaction  for  the  artificial  circle  (the  operator  •Srcjrc-*rdre  in  (5.4))  has  a  non-trivial  null- 
space.  This  complication  can  be  rectified  by  letting  the  artificial  charges  on  /  drc  consist  of  both  monopoles 
and  dipoles.  Alternatively,  it  is  possible  to  consider  only  one  type  of  charges  but  placing  them  on  two  con¬ 
centric  circles  instead  of  a  single  one. 

When  applied  to  oscillatory  problems  such  as  Helmholtz’  and  Maxwell’s  equations,  the  efficiency  of  the 
technique  deteriorates  when  the  wave  number  increases  since  then  the  compression  rate  deteriorates  as  the 
blocks  grow  larger  (in  other  words,  the  assumption  (4.5)  no  longer  holds).  In  practice,  it  appears  that  the 
method  experiences  very  few  problems  for  objects  smaller  than  about  50  wavelengths.  After  that,  the  com¬ 
putational  complexity  increases  superlinearly  with  the  problem  size  although  the  technique  remains  viable 
for  equations  set  on  contours  a  few  hundred  wavelengths  in  size.  This  effect  will  be  illustrated  in  the  numer¬ 
ical  examples  in  Section  6.1. 

Finally  we  remark  that  the  scheme  has  0(n  log*  n)  complexity  when  applied  to  integral  equations  defined 
on  one-dimensional  curves  in  any  dimension.  The  fact  that  we  have  so  far  only  discussed  equations  embed¬ 
ded  in  two  space  dimensions  is  simply  that  contour  integral  equations  associated  with  boundary  value  prob¬ 
lems  in  two  dimensions  is  the  most  common  source  of  such  equations. 


6.  Numerical  examples 

In  this  section  we  present  the  results  of  a  number  of  numerical  experiments  performed  to  assess  the  effi¬ 
ciency  of  the  numerical  scheme  presented  in  Sections  4  and  5.  In  every  experiment,  we  compute  a  com¬ 
pressed  factorization  of  the  inverse  of  the  matrix  resulting  from  Nystrom  discretization  of  one  of  the 
following  three  integral  equations: 
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±l2U{x)+hSr  [nM'V^loglx_:,;OuO;)dyO)=/W>  *er> 


(6.1) 


l 


[log  |  x-y  |]H(y)ds(y)  =  /(*), 


x  g  r, 


(6.2) 


T2iu(x)  + J  [(n(y)-Vy  +  ik)H0(k\x-y\)]u(y)ds{y)=f{x),  xeT,  (6.3) 

where  n(y)  is  the  outward  pointing  unit  normal  of  T  at  y  and  H0(x)  =  J0(x)  +  iTo(*)  is  the  Hankel  function 
of  zeroth  order.  Eqs.  (6.1)  and  (6.2)  are  the  double  and  single  layer  equations  associated  with  Laplace 
Dirichlet  problems,  and  (6.3)  is  an  equation  associated  with  the  Helmholtz  Dirichlet  problem  with  wave 
number  k.  In  Eqs.  (6.1)  and  (6.3),  the  top  sign  in  front  of  the  first  term  refers  to  exterior  problems  and 
the  lower  sign  refers  to  interior  problems. 

The  kernel  in  (6.1)  is  smooth  and  the  equation  was  discretized  using  the  trapezoidal  rule  (which  is  expo¬ 
nentially  convergent  on  a  smooth  contour).  The  Eqs.  (6.2)  and  (6.3)  involve  log-singular  kernels  that  were 
discretized  using  the  modified  trapezoidal  quadrature  rules  of  [10]  of  orders  6  and  10,  respectively.  The 
algorithm  was  implemented  in  Fortran  77  and  the  experiments  were  run  on  a  2.8  GHz  Pentium  4  desktop 
with  512Mb  of  RAM  memory. 

When  presenting  the  numerical  results,  we  use  the  following  notation: 

R  the  number  of  levels  in  the  multi-level  solver, 

Nstan  the  size  of  the  discrete  problem  at  the  start, 

JVfinai  the  size  of  the  compressed  problem, 
rtot  the  total  CPU  time  (in  seconds), 

fsoive  the  CPU  time  required  to  apply  the  factorized  inverse  (in  seconds), 
ctop  the  condition  number  of  the  compressed  matrix, 

<rmin  the  smallest  singular  value  of  the  original  matrix, 

M  the  amount  of  memory  used  (in  MB), 

Actual  the  relative  error  in  u,  Eac ,uai  =  Ik  -  «||/||«||, 

En s  the  relative  residual  error,  Ens  =  || Aut  -_/f|/|l/t|, 


In  each  experiment,  the  right  hand  side  /  was  the  Dirichlet  data  corresponding  to  a  potential  field  gen¬ 
erated  by  a  few  randomly  placed  point  charges.  Since  the  exact  potential  field  was  known,  we  could  com¬ 
pare  the  potential  field  generated  by  the  numerical  solution  to  the  exact  one.  We  did  this  at  /  random  points 
on  a  circle  enclosing  T  and  separated  from  T  by  one  quarter  of  its  radius.  Letting  {uw}j=i  denote  the  exact 
potential  and  denote  the  potential  generated  by  ue,  we  define  the  relative  /2-norm  error  in  the 

potential  as  Epot  =  ||t>  -  »J|/||t>||. 


6.1.  Example:  a  smooth  contour 

In  this  subsection  we  present  results  pertaining  to  the  smooth  contour  shown  in  Fig.  4.  The  contour  was 
discretized  using  between  800  and  102  400  points  and  the  integral  equations  associated  with  exterior  Dirich¬ 
let  problems  were  solved.  Tables  1-3  present  the  results  for  the  kernels  (6.1)-(6.3),  respectively.  As  a  refer¬ 
ence,  we  give  in  Table  4  the  timings  for  highly  optimized  implementations  of  the  LU-factorization,  direct 
matrix-vector  multiplication  and  FMM-accelerated  matrix-vector  multiplication. 

For  the  two  Laplace  problems  considered,  we  see  that  both  the  computational  cost  and  the  memory 
requirement  scale  more  or  less  linearly  with  the  problem  size,  as  expected.  We  recall  that  this  expectation 
was  based  on  the  postulate  that  for  Laplace  problems,  the  interaction  rank  between  adjacent  clusters  de- 
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Fig.  4.  A  smooth  contour.  The  length  of  the  contour  is  roughly  5.1  and  its  horizontal  width  is  2. 


Table  1 

Computational  results  for  the  double  layer  potential  (6.1)  associated  with  an  exterior  Laplace  Dirichlet  problem  on  the  contour  shown 


in  Fig.  4 


A^siart 

Affinal 

ftot 

^solvc 

•factual 

fpot 

£tOp 

^min 

M 

400 

301 

5.3e-01 

2.9e— 03 

2.3e-10 

4.7e-10 

3.0e— 06 

4.3e+00 

1.3e-02 

4.2e+00 

800 

351 

9.6c-01 

4.1e— 03 

2.5e-10 

2.2e— 10 

6.3e-10 

9.1e+00 

1.2e-02 

6.5e+00 

1600 

391 

1.6e+00 

6.3e-03 

1.4e-10 

1.3e— 10 

1.6e-10 

1.6e+01 

1.2e-02 

9.2e+00 

3200 

391 

1.8e+00 

8.5e-03 

- 

6.6e— 11 

3.7e-10 

3.2e+01 

1.2e-02 

l.le+01 

6400 

391 

2.2e+00 

1.2e-02 

- 

5.9e —  1 1 

8.9e-ll 

7.7e+01 

1.2e~02 

1.4e+01 

12  800 

390 

2.6e+00 

1.9e— 02 

_ 

3.6e— 11 

5.9e-l  1 

1.4e+02 

1.2e-02 

2.1e+01 

25  600 

391 

3.9e+00 

3.4e— 02 

- 

2.7e-ll 

4.7e— 10 

2.1e+02 

- 

3.5e+01 

51200 

393 

6.5c+00 

6.5e— 02 

~ 

2.5e —  1 1 

5.3e — 1 1 

2.0e+02 

- 

6.3e+01 

102  400 

402 

1.3e+01 

1.2e-01 

- 

2.0e— 11 

- 

l.le+03 

- 

1.2e+02 

Table  2 


Computational  results  for  the  single  layer  potential  (6.2)  associated  with  an  exterior  Laplace  Dirichlet  problem  on  the  contour  shown 
in  Fig.  4  _ _____ _ 


Afgtart 

Affinal 

ttoi 

^solvc 

factual 

fres 

fpot 

ctop 

^min 

M 

400 

253 

4.1e-01 

1.9e-03 

4.6e-09 

2.7e-09 

1.6e— 04 

2.2e+01 

3.5e-02 

3.1e+00 

800 

306 

8.2e-01 

3.3e— 03 

7.5e-09 

9.9e— 09 

2.4e-06 

1.6e+02 

2.9e-04 

5.4e+00 

1600 

353 

1.6e+00 

6.2e-03 

4.9e-09 

6.3e-09 

1 .6e— 09 

1.5e+02 

1.4e-04 

8.6e+00 

3200 

369 

2.3e+00 

9.7e-03 

- 

2.5e— 07 

1.2e— 10 

2.1e+02 

4.2e-05 

1.2e+01 

6400 

379 

3.2e+00 

1 .6e— 02 

- 

1.3e-08 

6.8e— 12 

2.6e+02 

2.1e-05 

1.8e+01 

12  800 

395 

4.8e+00 

2.7e— 02 

- 

1.7e-08 

3.4e— 12 

2.8e+02 

2.7e-0  6 

2.9e+01 

25  600 

409 

7.7e+00 

4.8e— 02 

- 

3.6e-08 

1 .4e—  1 1 

3.5e+02 

2.7e~07 

5.0e+01 

51200 

419 

1.4e+01 

9.0e-02 

~ 

2.7e-07 

- 

3.7e+02 

3.5e-07 

9.1e+01 

429 

3.6e+01 

1.7e— 01 

- 

1.6e-08 

- 

5.2e+02 

- 

1.7e+02 

Table  3 

Computational  results  for  the  kernel  (6.3)  associated  with  an  exterior  Helmholtz  Dirichlet  problem  on  the  contour  shown  in  Fig.  4 


k 

A^start 

Affinal 

ho\ 

fgolve 

factual 

fres 

fpot 

Ctop 

O'min 

M 

21 

435 

l,5e+01 

3.3e-02 

2.7e-07 

9.7e— 08 

7.1e-07 

4.1e+03 

6.5e-01 

1.3e+01 

40 

1600 

550 

3.0e+01 

6.7e-02 

1.6e-07 

6.2e-08 

4.0e-08 

6.1e+03 

8.0e-01 

2.5e+01 

79 

3200 

683 

5.3e+01 

ft> 

1 

O 

- 

5.3e— 08 

3.8e-08 

2.1e+04 

3.4e-01 

4.5e+01 

158 

6400 

870 

9.2e+01 

2.0e-01 

- 

3.9e— 08 

2.9e— 08 

4.0e+04 

3.4e— 01 

8.2e+01 

316 

12  800 

1179 

1.8e+02 

3.9e-01 

- 

2.3e— 08 

2.0e-08 

4.2e+04 

3.4e— 01 

1.6e+02 

632 

25  600 

1753 

4.3e+02 

7.5e+00 

- 

1.7e— 08 

1.4e— 08 

9.0e+04 

3.3e-01 

3.5e+02 

1264 

2864 

(1.5e+03) 

(2.3e+02) 

- 

9.5e— 09 

- 

- 

- 

8.3e+02 

The  Helmholtz  parameter  was  chosen  to  keep  the  number  of  discretization  points  per  wavelength  constant  at  roughly  45  points  per 
wavelength  (resulting  in  a  quadrature  error  about  10“ 12).  The  times  in  parenthesis  refer  to  experiments  that  did  not  fit  in  RAM. 
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Table  4 

Timings  (in  seconds)  for  highly  optimized  implementations  of  the  LU-factorization,  matrix-vector  multiplication  and  FMM 
accelerated  matrix-vector  multiplication 


N 

400 

800 

1600 

3200 

6400 

12  800 

25  600 

51200 

102  400 

iLV 

^mu!t 

*FMM 

2.8e-02 
7.5e— 04 
3.8e— 03 

2.0c-01 

2.9c-03 

8.0e-03 

1.6e+00 
1.2e— 02 
1.6e~02 

1.3e+01 

4.8e-02 

3.0e-02 

(I.0e+02) 
(1.9e-02) 
6.0e— 02 

(8.3e+02) 

(7.7e-01) 

1.2e-01 

(6.7e+03) 
(3.1e+00) 
2.4e— 01 

(5.3e+04) 
(1.2e+01) 
4.8e— 01 

(4.3e+05) 

(4.9e+01) 

9.6e-01 

The  FMM  was  run  at  a  relative  accuracy  of  10~10  with  the  same  kernel  as  in  the  Eq.  (6.2).  The  numbers  in  parenthesis  are  extrapolated. 


pend  only  very  weakly  (logarithmically)  on  their  size.  Fig.  5  illustrates  this  point;  it  shows  that  after  two 
rounds  of  compression,  almost  the  only  nodes  that  have  survived  are  the  ones  near  the  border  to  the  neigh¬ 
boring  clusters.  The  figure  also  illustrates  that  the  algorithm  detects  the  need  to  keep  more  nodes  in  the 
interior  of  those  clusters  that  run  close  to  other  clusters.  (For  an  example  of  a  situation  where  the  Eq. 
(6.1)  needs  to  be  discretized  using  a  large  number  of  nodes  in  spite  of  the  fact  that  the  contour  is  uncom¬ 
plicated,  see  [11].) 

Since  the  scheme  presented  in  this  paper  relics  on  rank-considerations  only,  it  works  for  oscillatory  prob¬ 
lems  with  low  wave  numbers  but  it  eventually  fails  as  the  wavenumber  is  increased.  Table  5  illustrates  this 
point  by  showing  how  the  compression  ratios  deteriorate  as  the  wavenumber  k  in  Eq.  (6.3)  is  increased  from 
1  to  1200.  However,  the  authors  were  surprised  to  find  that  the  method  remains  viable  up  to  objects  about 
200  wavelengths  across,  as  indicated  in  Table  3. 

Remark  17.  (Comparison  with  the  fast  multipole  method)  From  Tables  1  and  4,  we  see  that  a  single  FMM 
matrix-vector  multiply  is  about  15-20  times  faster  than  a  matrix  inversion.  Thus,  if  an  iterative  solver 
requires  less  than  15-20  iterations  to  solve  Eq.  (6.1),  this  would  beat  the  direct  method  for  a  single  solve. 
However,  once  the  inverse  has  been  computed,  it  can  be  applied  to  additional  right  hand  sides  in  about  one 
tenth  of  the  time  required  for  a  single  FMM  accelerated  matrix-vector  multiply. 


Fig.  5.  The  points  left  after  two  rounds  of  compression  of  the  contour  shown  in  Fig.  4.  The  crosses  mark  the  boundary  points  between 
adjacent  dusters. 


Table  5 

Deterioration  of  compression  rates  for  large  wavenumber  Helmholtz  problems 


k 

yi 

72 

73 

V4 

75 

76 

77 

7s 

Affinal 

M 

1 

0.68 

0.58 

0.54 

0.55 

0.58 

0.64 

0.64 

0.72 

512 

167 

100 

0.72 

0.56 

0.55 

0.56 

0.60 

0.68 

0.72 

0.82 

111 

197 

500 

0.72 

0.58 

0.58 

0.62 

0.68 

0.76 

0.84 

0.91 

1522 

303 

The  table  shows  the  compression  ratio  yjt  see  (4.4),  at  each  of  the  levels  j  =  1, . . .,  8  for  the  Helmholtz  kernel  (6.3)  on  the  smooth  contour 
in  Fig.  4,  discretized  with  N  -  25  600  points.  The  three  rows  correspond  to  wave  numbers  k  =  1,  100,  500.  The  second  to  last  column 
shows  the  number  of  degrees  of  freedom  left  on  the  finest  level  and  the  last  column  shows  the  total  memory  requirement  (in  MB). 


I 

i 
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6.2.  A  rippled  contour  that  almost  self-intersects 

In  this  subsection  we  present  results  pertaining  to  the  rippled  contour  shown  in  Fig.  6.  The  contour  was 
discretized  using  between  800  and  102  400  points  and  integral  equations  associated  with  exterior  Dirichlet 
problems  were  solved.  The  number  of  ripples  in  the  experiments  increase  with  the  number  of  discretization 
nodes  in  such  a  fashion  that  there  are  roughly  80  nodes  for  each  wavelength.  Tables  6-8  present  the  results 
for  the  kernels  (6.1)-(6.3),  respectively. 


Fig.  6.  (a)  A  rippled  contour,  (b)  A  close-up  of  the  area  marked  by  a  dashed  rectangle  in  (a).  The  horizontal  axis  of  the  contour  has 
length  1  and  the  number  of  ripples  change  between  the  different  experiments  to  keep  a  constant  ratio  of  80  discretization  nodes  per 
wavelength. 


Table  6 

Computational  results  for  the  double  layer  potential  (6.1)  associated  with  an  exterior  Laplace  Dirichlet  problem  on  the  rippled  contour 


shown  in  Fig.  6 


■Nslart 

Afinal 

ftot 

^ sol  VC 

^actual 

Ercs 

Epot 

ctop 

°min 

M 

400 

171 

2.3e-01 

1.0e-03 

1.5e-10 

Lie— 10 

1.3e— 07 

7.4e+00 

Lie— 01 

1.5e+00 

800 

228 

3.5e— 01 

1.0e-02 

1.9e— 10 

1.3e-10 

3.8e— 08 

9.7e+00 

7.6e— 02 

3.0e+00 

1600 

306 

7.3e-01 

5.8e— 03 

1.3e-10 

1.6e-10 

5.5e— 08 

1.6e+01 

5.2e— 02 

6.2e+00 

3200 

386 

2.2e+00 

8.5e— 03 

- 

1 .4e— 10 

7.5e— 08 

3.1e+01 

3.9e— 02 

1.2e+01 

6400 

460 

4.4e+00 

1.7e-02 

- 

7.2e —  1 1 

8.2e— 08 

7.0e+01 

3.3e— 02 

2.1e+01 

12  800 

536 

9.6e+00 

3.5e-02 

- 

5.9e— 1 1 

3.7e— 08 

1.4e+02 

2.9e— 02 

4.0e+01 

25  600 

597 

2.0c+01 

7.6e-02 

- 

2.0e-ll 

1.4e-09 

2.2e+02 

- 

7.6e+01 

51200 

641 

4.0e+01 

1.5e— 01 

- 

2.9e-ll 

- 

6.2e+02 

- 

1.5e+02 

102  400 

688 

(1.8e+01) 

3.9e-0I 

- 

1.2e— 11 

- 

7.8e+02 

- 

2.9e+02 

Table  7 

Computational  results  for  the  single  layer  potential  (6.2)  associated  with  an  exterior  Laplace  Dirichlet  problem  on  the  rippled  contour 


shown  in  Fig.  6 


A^start 

Affinal 

t\Ol 

^solvc 

Etcs 

^pot 

ctop 

^min 

M 

400 

176 

2.4e-01 

9.2e— 04 

2.1e— 09 

1.7e-09 

2.4e— 05 

1.6e+02 

5.5e-04 

1 .6e+00 

800 

220 

3.9e— 01 

3.8e— 03 

1.6e-08 

3.0e— 08 

8.0e-06 

l.le+03 

1.0e-05 

3.1e+00 

1600 

256 

6.9e-01 

5.3e-03 

5.2e— 09 

7.0e-09 

9.8e-08 

2.8e+02 

1.6e-05 

5.3e+00 

3200 

286 

1.3e+00 

7.6e-03 

- 

7.0e— 09 

1.6e— 08 

3.3e+02 

1.2e-05 

9.1e+00 

6400 

314 

2.5e+00 

1.4e— 02 

- 

1.5e-07 

2.3e— 09 

7.5e+02 

2.1e-06 

1.6e+01 

12  800 

342 

4.6e+00 

2.8e-02 

- 

2.4e— 08 

1.5e— 09 

4.7e+02 

1.7e— 07 

2.9e+01 

25  600 

362 

8.8e+00 

6.2e— 02 

- 

2.3e— 08 

2.2e-09 

l.le+03 

9.7e-08 

5.5e+01 

51  200 

374 

L7e+01 

1.2e— 01 

- 

2.1e— 08 

- 

1.8e+03 

3.1e-08 

l.le+02 

102  400 

386 

(8.1e+0) 

2.3e— 01 

- 

1.5e-07 

- 

3.1e+03 

- 

2.1e+02 

m 

p: 
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Table  8 

Computational 
Fig.  6 


results  for  the  kernel  (6.3)  associated  with  an  exterior  Helmholtz  Dirichlet  problem  on  the  rippled  contour  shown  in 


N final  f  tot 


f  solve 


^actual 


-Ercs 

£Pot 

^top 

^min 

M 

6.9e-08 

9.4e-07 

1.2e+04 

7.9e-01 

3.2e+00 

7.4e-08 

1.2e-07 

3.9e+03 

7.9e-01 

8.2e+00 

6.7e-08 

8.1e-08 

7.4e+03 

7.8e-01 

2.0e+01 

5.2e-08 

6.4e-08 

1.2e+04 

8.0e-01 

5.0e+01 

4.8e-08 

7.5e-08 

1.4e+04 

8.0e-01 

1.3e+02 

5.5e-08 

7.5e~08 

8.8e+04 

8.0e-01 

3.4e+02 

_ 

_ 

- 

- 

9.8e+02 

7 

400 

224 

2.9c+00 

15 

800 

320 

7.7e+00 

29 

1600 

470 

2.1e+01 

58 

3200 

704 

6.1e+01 

115 

6400 

1122 

1.4e+02 

230 

12  800 

1900 

(4.7e+02) 

461 

25  600 

3398 

- 

The  Helmholtz  parameter  k  was  chosen 

9.0e-03 
1.9e— 02 
4.6e-02 
l.le-01 
2.9e-01 
(2.5e+01) 


1.4e— 07 
1.6e-07 


wavelength  (resulting  in  a  quadrature  error  about  10~12).  The  times  in  parenthesis  refer  to  experiments  that  did  not  fit  in  RAM. 


We  see  that  the  asymptotic  complexity  of  the  algorithm  remains  essentially  the  same  as  for  the  smooth 
contour  shown  in  Fig.  4.  However,  the  constants  involved  are  larger  since  more  degrees  of  freedom  are  re¬ 
quired  to  resolve  the  contour  at  the  finest  levels. 

6.3.  An  interior  problem  close  to  a  resonance 

In  this  section  we  present  results  pertaining  to  interior  Dirichlet  problem  on  the  contour  shown  in  Fig.  7. 
While  interior  and  exterior  Laplace  Dirichlet  problems  are  quite  similar  in  nature,  the  corresponding  Helm¬ 
holtz  Dirichlet  problems  are  fundamentally  different  in  that  the  interior  problem  possesses  resonances  while 
the  exterior  does  not.  We  will  therefore  focus  exclusively  on  interior  Helmholtz  problems. 

We  present  the  results  of  two  computational  experiments,  both  relating  to  the  Helmholtz  kernel  (6.3).  In 
the  first  experiment,  we  scan  a  range  of  wave  numbers  k  between  99.9  and  100.1.  For  each  wave  number,  we 
computed  the  smallest  singular  value  o-min  of  the  integral  operator  using  the  iteration  technique  described  in 
Section  4.5.  The  resulting  graph  of  ami„  versus  k,  shown  in  Fig.  8,  clearly  indicates  the  location  of  each  res¬ 
onance  in  this  interval.  The  second  experiment  consists  of  factoring  the  inverse  of  the  matrix  corresponding 
to  k  =  100.01 10276-  •  •  for  which  <rmin  =  0.00001366-  •  •.  The  results,  shown  in  Table  9,  illustrate  that  the 
method  does  not  experience  any  difficulty  in  factoring  the  inverse  of  a  reasonably  ill-conditioned  matrix. 
In  particular,  the  table  shows  that  the  factorization  matrices  B I®,  and  £>w,  see  (3.17),  are  well- 

conditioned. 


Fig.  7.  A  contour  the  shape  of  a  smooth  pentagram.  Its  diameter  is  2.5  and  its  length  is  roughly  8.3. 
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99.9  89.92  99.94  99.96  99.98  100  100.02  100.04  100.06  100.08  100.1 


Fig.  8.  Plot  of  <jm\n  versus  k  for  an  interior  Helmholtz  problem  on  the  contour  shown  in  Fig.  7.  The  values  shown  were  computed  using 
the  iteration  technique  of  Section  4.5  applied  to  a  matrix  of  size  N  =  6400.  Each  point  in  the  graph  required  about  60  s  of  CPU  time. 


Table  9 


Details  of  the  computation  for  the  Helmholtz  kernel  (6.3)  associated  with  an  interior  Dirichlet  problem  on  the  smooth  pentagram 
shown  in  Fig.  7  for  the  case  N  =  6400  and  k  =  100.01 1027569*  •  * 


j 

Pj 

nJ 

yj 

‘j 

l|C«tc 

Halloo 

m*. 

1 

128 

50.00 

0.76 

15.50 

1.12e+00 

1.12e+00 

4.20e— 02 

2 

64 

76.00 

0.59 

14.32 

3.27e+01 

3.27e+01 

1.75e+00 

3 

32 

89.72 

0.60 

8.94 

1.63e+01 

1.62e+01 

9.28e-01 

4 

16 

107.00 

0.64 

6.27 

9.09e+00 

9.17e+00 

2.41e+00 

5 

8 

138.00 

0.72 

5.97 

7.32e+00 

7.31e+00 

3.64e+00 

6 

4 

199.50 

0.80 

7.76 

3.22e+00 

3.23e-H)0 

3.86e+00 

For  each  level  j,  the  table  shows  the  number  of  clusters  pj  on  that  level,  the  average  size  of  a  cluster  nJy  the  compression  ratio  yj ,  the  time 
required  for  the  factorization  tj  and  the  size  of  the  matrices  B®y  C00  and  Z)w  (see  (3.17))  in  the  maximum  norm.  For  this  computation, 
£rcs  =  2.8  x  10" J0,  £pot  =  3.3  x  10"5  and  <7min  =  1.4 x  10"5. 


6.4.  A  contour  resembling  an  area  integral 

The  final  numerical  experiment  that  we  present  is  included  to  demonstrate  that  the  efficiency  of  the  fac¬ 
torization  scheme  deteriorates  when  it  is  applied  to  a  curve  for  which  the  physical  distance  between  two 
random  points  on  the  contour  is  not  well  predicted  by  their  physical  separation.  One  example  of  such  a 
curve  is  the  star-fish  lattice  illustrated  in  Fig.  9.  Focusing  on  the  double  layer  Laplace  problem  (6.1),  we 
apply  the  factorization  scheme  to  a  matrix  of  size  N  =  25  600  and  compare  the  performance  to  that  for 
the  rippled  dumb-bell  shown  in  Fig.  6.  Table  10  shows  that  the  factorization  of  the  matrix  related  to  the 
starfish  lattice  took  almost  five  times  as  long  and  resulted  in  a  compressed  matrix  of  over  twice  the  size. 

To  understand  the  difference  in  performance  between  the  different  contours,  we  need  to  consider  how  the 
interaction  rank  of  a  cluster  depends  on  its  size.  For  the  contours  shown  in  Figs.  4,  6,  and  7,  we  have  seen 
that  the  rank  of  the  interaction  between  a  cluster  of  size  m  and  the  rest  of  the  contour  is  effectively  bounded 
by  logm.  However,  for  the  contour  shown  in  Fig.  9  the  corresponding  bound  is  yfm.  Figs.  5  and  10  illus- 
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7h 


5b 


4b 


3b 


ib 


10 


Fig.  9.  The  star-fish  lattice  contour;  the  physical  distance  between  two  random  points  on  the  contour  is  not  well  predicted  by  their 
distance  along  the  contour. 


Table  10 

Test  results  for  two  experiments  concerning  the  matrix  obtained  by  discretizing  the  double  layer  Laplace  problem  (6.1) 


Contour: 

liot 

•^start 

•^final 

M 

Rippled  dumb-bell 

37s 

25  600 

559 

86Mb 

Star-fish  lattice 

172s 

25  600 

1202 

210Mb 

The  table  illustrates  the  difference  in  performance  of  the  algorithm  when  applied  to,  on  the  one  hand,  the  contour  shown  in  Fig.  6  (top 
line),  and  on  the  other  hand,  the  contour  shown  in  Fig.  9  (lower  line). 


Fig.  10.  Figure  (a)  shows  a  close-up  of  the  star-fish  lattice  of  Fig.  9.  Figure  (b)  shows  the  nodes  remaining  after  the  interaction  between 
the  cluster  formed  by  the  points  inside  the  parallelogram  and  the  remainder  of  the  contour  has  been  compressed,  cf.  Fig.  5. 
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trate  the  difference.  Thus,  the  asymptotic  complexity  of  the  scheme  when  applied  to  a  contour  similar  to  the 
star-fish  lattice  is  0(/rV2)  rather  than  O (n  log  n). 


7.  Generalizations  and  conclusions 

We  have  presented  a  numerical  scheme  that  constructs  a  compressed  factorization  of  the  inverse  of  a 
matrix.  The  scheme  is  applicable  to  generic  matrices  whose  off-diagonal  blocks  have  rank-deficiencies 
but  is  most  efficient  when  applied  to  matrices  arising  from  the  discretization  of  integral  equations  defined 
on  one-dimensional  contours.  (Although  such  integral  equations  frequently  arise  in  the  analysis  of  bound¬ 
ary  value  problems  in  two  dimensions,  the  dimension  of  the  underlying  space  is  of  little  relevance  to  the 
algorithm.)  For  equations  with  non-oscillatory  kernels  the  computational  complexity  of  the  algorithm  is 
0(/2  log*  n)  for  most  contours,  where  k  =  1  or  2,  and  n  is  the  number  of  nodes  in  the  discretization  of 
the  contour. 

Comparing  our  implementations  of  the  direct  factorization  scheme  on  the  one  hand  and  the  FMM  ma¬ 
trix-vector  multiplication  scheme  on  the  other,  we  observed  (i)  that  in  a  typical  environment,  the  cost  of 
constructing  a  factorization  of  the  inverse  is  15-20  times  larger  than  the  cost  of  a  single  FMM  matrix-vector 
multiply,  and  (ii)  that  once  the  factorization  of  the  inverse  has  been  computed,  the  cost  to  apply  it  to  a  vec¬ 
tor  is  5-10  times  smaller  than  the  cost  of  a  single  FMM  matrix-vector  multiply.  Thus,  if  an  iterative  solver 
requires  less  than  20  steps  to  converge,  the  iterative  solver  outperforms  the  direct  solver  for  a  single  solve. 
However,  if  multiple  right-hand  sides  are  involved,  the  direct  solver  has  a  clear  advantage.  This  observation 
is  the  foundation  for  [11]. 

Since  the  scheme  is  based  on  rank  considerations  only,  it  cannot  work  for  boundary  integral  equations 
involving  highly  oscillatory  kernels.  However,  since  the  interaction  ranks  are  determined  dynamically,  the 
oscillation  must  be  quite  significant  before  the  scheme  becomes  impracticable.  Empirically,  it  was  found 
that  the  scheme  remains  efficient  for  contours  a  couple  of  hundred  wavelengths  in  size. 

Another  limitation  of  the  scheme  is  that  it  does  not  achieve  optimal  efficiency  when  applied  to  a  bound¬ 
ary  integral  equation  set  on  either  a  contour  similar  to  the  one  shown  in  Fig.  9,  or  on  a  two-dimensional 
surface.  In  either  case,  its  computational  complexity  is  0(«3/2).  Overcoming  this  limitation  is  a  subject  of 
on-going  research. 

Finally,  we  mention  that  the  matrix  factorization  scheme  presented  in  this  paper  can  be  modified  to  con¬ 
struct  certain  standard  matrix  factorizations  (such  as  the  singular  value  decomposition).  This  modification 
will  be  reported  at  a  later  date. 
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Abstract 

A  stable  second  kind  integral  equation  formulation  has  been  developed  for  the  Dirichlet  problem  for  the  Laplace 
equation  in  two  dimensions,  with  the  boundary  conditions  specified  on  a  collection  of  open  curves.  The  performance  of 
the  obtained  apparatus  is  illustrated  with  several  numerical  examples. 

©  2003  Elsevier  Science  B.V.  All  rights  reserved. 

AMS:  65R10;  77C05 

Keywords:  Open  surface  problems;  Laplace  equation;  Finite  Hilbert  transform;  Second  kind  integral  equation;  Dirichlet  problem 


1.  Introduction 

Integral  equations  have  been  one  of  principal  tools  for  the  numerical  solution  of  scattering  problems  for 
more  than  30  years,  both  in  the  Helmholtz  and  Maxwell  environments.  Historically,  most  of  the  equations 
used  have  been  of  the  first  kind,  since  numerical  instabilities  associated  with  such  equations  have  not  been 
critically  important  for  the  relatively  small-scale  problems  that  could  be  handled  at  the  time. 

The  combination  of  improved  hardware  with  the  recent  progress  in  the  design  of  “fast”  algorithms  has 
changed  the  situation  dramatically.  Condition  numbers  of  systems  of  linear  algebraic  equations  resulting 
from  the  discretization  of  integral  equations  of  potential  theory  have  become  critical,  and  the  simplest  way 
to  limit  such  condition  numbers  is  by  starting  with  second  kind  integral  equations.  Hence,  interest  has 
increased  in  reducing  scattering  problems  to  systems  of  second  kind  integral  equations  on  the  boundaries  of 
the  scatterers. 

During  the  last  several  years,  satisfactory  integral  equation  formulations  have  been  constructed  in  both 
acoustic  (Helmholtz  equation)  and  electromagnetic  (Maxwell’s  equations)  environments,  whenever  all  of 
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the  scattering  surfaces  are  “closed”  (i.e.,  scatterers  have  well-defined  interiors,  and  have  no  infinitely  thin 
parts).  Boundary  value  problems  for  the  biharmonic  equation  with  boundary  data  specified  on  a  collection 
of  open  curves  have  been  investigated  in  some  detail  in  [9-11].  However,  a  stable  second  kind  integral 
equation  formulation  for  scattering  problems  involving  “open”  surfaces  does  not  appear  to  be  present  in 
the  literature. 

In  this  paper,  we  describe  a  stable  second  kind  integral  equation  formulation  for  the  Dirichlet  problem 
for  the  Laplace  equation  in  R2,  with  the  boundary  conditions  specified  on  an  “open”  curve.  We  start  with  a 
detailed  investigation  of  the  case  when  the  curve  in  question  is  the  interval  [-1,1]  on  the  real  axis;  then  we 
generalize  the  obtained  results  for  the  case  of  (reasonably)  general  open  curves. 

The  layout  of  the  paper  is  as  follows.  In  Section  2,  the  necessary  mathematical  and  numerical  prelim¬ 
inaries  are  introduced.  Section  3  contains  the  exact  statement  of  the  problem.  Section  4  contains  an  in¬ 
formal  description  of  the  procedure.  In  Sections  5  and  6,  we  investigate  the  cases  of  the  straight  line  segment 
and  of  the  general  sufficiently  smooth  curve,  respectively.  In  Section  7,  we  describe  a  simple  numerical 
implementation  of  the  scheme  described  in  Section  6.  The  performance  of  the  algorithm  is  illustrated  in 
Section  8  with  several  numerical  examples.  Finally,  in  Section  9  we  discuss  several  generalizations  of  the 
approach. 


2.  Analytical  preliminaries 

In  this  section,  we  summarize  several  results  from  classical  and  numerical  analysis  to  be  used  in  the 
remainder  of  the  paper.  Detailed  references  are  given  in  the  text. 

2.1.  Notation 

Suppose  that  a,  b  are  two  real  numbers  with  a  <  b,  and  f,g  :  [a,b\  — ►  €  is  a  pair  of  smooth  functions, 
and  that  on  the  interval  [a,  b],  the  function  g  has  a  single  simple  root  s.  Throughout  this  paper,  we  will  be 
repeatedly  encountering  expressions  of  the  form 


normally  referred  to  as  principal  value  integrals.  In  a  mild  abuse  of  notation,  we  will  refer  to  expressions  of 
the  form  (1)  simply  as  integrals.  We  will  also  be  fairly  cavalier  about  the  spaces  on  which  operators  of  the 
type  (1)  operate;  whenever  the  properties  (smoothness,  boundedness,  etc.)  required  from  a  function  are 
obvious  from  the  context,  their  exact  specifications  are  omitted. 

2.2.  Chebyshev  polynomials  and  Chebyshev  approximation 

Chebyshev  polynomials  are  frequently  encountered  in  numerical  analysis.  As  is  well  known,  Chebyshev 
polynomials  of  the  first  kind  T„  :  [-1, 1]  -+  U(n  >  0)  are  defined  by  the  formula 

Tn{x)  =  cos(«  arccos(x))  (2) 

and  are  orthogonal  with  respect  to  the  inner  product 

(/,  g)  =  J  ]  /(■*)  •  s(x)  ■  ^== dx- 


(3) 
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The  Chebyshev  nodes  x{  of  degree  N  are  the  zeros  of  7#  defined  by  the  formula 

(2/  -j-  1  )7C 


Xi  =  cos- 


2N 


i  =  0, 1 , . . . ,  AT  —  1 . 


Chebyshev  polynomials  of  the  second  kind  U„ :  [-1, 1] 


^  0)  are  defined  by  the  fonnula 


Unix)  = 


sin((w  +  1 )  arccos(x)) 
sin(arccos(x)) 


(4) 


(5) 


and  are  orthogonal  with  respect  to  the  inner  product 

( f,g )  =  J  fix)  ■  g(x)  ■  Vl  -  x2dx.  (6) 

The  Chebyshev  nodes  of  the  second  kind  tj  of  degree  N  are  the  zeros  of  Un  defined  by  the  formula 

tj  =  cos  j  =  (7) 

J  4- 1 

For  a  sufficiently  smooth  function  / :  [—1, 1]  — ♦  R,  its  Chebyshev  expansion  is  defined  by  the  formula 

/W  =  Ecrr*w,  (8) 


*=0 


with  the  coefficients  Ck  given  by  the  formulae 

—  I  £ 

II 

f7w-7o(*Ml-*T'/2d*, 

-1 

(9) 

and 

C*“;J 

f7(*)-7i(*Ml-xT,/2d*, 

-1 

(10) 

for  all  k  >  1.  We  will  also  denote  by  PNf  the  order  N  -  1  Chebyshev  approximation  to  the  function  /  on  the 
interval  [-1,1],  i.e.,  the  (unique)  polynomial  of  order  N—l  such  that  P^(x,)=/(x,)  for  all 
i  =  0, 1, . . .  ,N  -  1,  with  Xi  the  Chebyshev  nodes  defined  by  (4). 

The  following  lemma  provides  an  error  estimate  for  the  Chebyshev  approximation  (see,  for  example  [5]). 


Lemma  1.  If  f  e  C*[— 1, 1]  (i.e.,  f  has  k  continuous  derivatives  on  the  interval  [-1,1]),  then  for  any 

*€  [-1,1], 

|/y(,)-/M|=0(i;).  (11) 

In  particular,  iff  is  infinitely  differentiable,  then  the  Chebyshev  approximation  converges  superalgebraically 
{i.e.,  faster  than  any  finite  power  of  l /N  as  N — ►  oo). 


2.3.  The  finite  Hilbert  transform 

We  will  define  the  finite  Hilbert  transform  H  by  the  formula 


(12) 
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We  then  define  the  operator  K  :  C2[-l,  1]  — ►  Z,2(-oo,oo)  by  the  formula 


K(<p)(x)  =  lira 


(t-x) 


dr  + 


<p(‘)  df  2q>(x) 

it-*)2 


(13) 


and  observe  that  the  limit  (13)  is  often  referred  to  as  the  finite  part  integral 


(14) 


(see,  for  example  Hadamard  [8]). 

The  following  theorem  can  be  found  in  [13];  it  provides  sufficient  conditions  for  the  existence  of  the  finite 
part  integral  (14),  and  establishes  a  connection  between  the  finite  Hilbert  transform  (12)  and  the  finite  part 
integral. 

Theorem  2.  For  any  q>  e  C2[-l,  1],  the  limit  (13)  is  a  square-integrable  function  of  x.  Furthermore, 

K{cp)=D°H{q>),  (15) 

with  D  =  ~  the  differentiation  operator. 

The  following  theorem  (see,  for  example  [21])  describes  the  inverse  of  the  operator  H,  to  the  extent  that 
such  an  inverse  exists 

Theorem  3.  The  null  space  of  the  operator  H  is  spanned  by  the  function  1  /  \ZT-  x1.  Furthermore,  for  any 
function  f  €  1P\—  1, 1]  with  p  >  1,  all  solutions  of  the  equation 


H{(p)=f  (16) 

are  given  by  the  formula 

=  +  (17) 

with  C  an  arbitrary  constant,  and  the  operator  T  :  LP[—\,  1]  — ►  Z/>[ —  1 , 1]  defined  by  the  formula 

T(f)(x)  =  vr^-f(x).  (18) 


Applying  Theorem  3  twice,  we  immediately  obtain  the  following  corollary: 
Corollary  4.  For  any  f  e  C1  [-1, 1],  all  solutions  of  the  equation 
HoH((p)  =  H2{q>)=f 
are  given  by  the  formula 

*) =y]°*2°  +wh' +  Tra  •  logB 


(19) 

(20) 


with  C0,  Ci  two  arbitrary  constants . 
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2.4.  Several  elementary  identities 


In  this  section,  we  collect  several  identities  from  classical  analysis  to  be  used  in  the  remainder  of  the  paper. 
Lemma  5  states  a  well-known  fact  about  the  two-dimensional  Poisson  kernel  y/(x?  +  y2);  it  can  be  found  in 
(for  example)  [19].  Lemma  6  provides  explicit  expressions  for  the  finite  Hilbert  transform  operating  on 
Chebyshev  polynomials,  where  (22)  is  a  direct  consequence  of  Lemma  3,  and  (23),  (24)  can  be  found  in  [2]. 
Lemma  7  lists  several  standard  definite  integrals;  all  can  be  found  (in  a  somewhat  different  form)  in  [6].  Fi¬ 
nally,  Lemma  8  follows  from  the  definition  of  curvature  c(t)  found  in  elementary  differential  geometry  (cf.  [3]). 


Lemma  5.  Suppose  that  a  £  LP[- 1, 1]  (p  ^  1).  Then 


lim 

y — >0 


H 

rc((x  - /)2 +/) 


•  <x(f)  dt  =  o(x), 


for  almost  all  x  £  [—1,1]. 


(21) 


Lemma  6.  For  any  x  €  (—1,1), 


1 


dt  =  0, 


(22) 


and 


i: 

i: 


t-x 


■U^(t)dt=-n-T„(x), 


1 

t-x 


-j=-Tn{t)dt  =  n-Un.\{x\ 


for  any  n  ^  1. 


(23) 

(24) 


Lemma  7. 

1.  For  any  x,  t  €  (—1, 1)  andx  ^  t. 


['  1  .  log^-lQg^f 

L  {s-x){s-t)as 


x-t 


2.  For  any  (x,y)  £  R2  \  [-1, 1]  and  t  £  (— 1, 1), 

j  w(arctan(Jf)+arctan(W)) 

dj  — —  "  "  ■ 


/; 


((s  -  xf  +  /)(5  -  t)  ((x  -tf+y1) 

<*-»>•  (lot^.-y^) 


3.  For  any  x  £  (—  1, 1), 


/; 


log  \x  -  /| 


+■ 


dt  =  -tc  •  log  2, 


2((x-tf+f ) 


(25) 


(26) 


(27) 
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z*1  / 

J  '  log  \x-t\-  ~^= ___  df  =  -n  X, 


/'  1  1  ,  „  n-arccos* 

i_.__.1°g(1  +  ,)d?  = 


x 

l  l 

*  Vl  -t2 


log(l  -t)6t  = 


7C  *  (arccos(x)  -  n) 
Vl  —  x2 


f  — _ — --log(l  +t)dt  =  n-  (arccos(jc)  ■  Vl  -x2  +  log(2)  -x-  1), 

7-i  t~x 


f  — \ — L .  i0g(i  -  t)dt  =  7t  •  ((arccos(x)  -  7t)  •  Vl  -x2  +  log(2)  -x-f  1). 
7-i  *  ~  x 


(28) 

(29) 

(30) 

(31) 

(32) 


Lemma  8.  Suppose  that  y  :  [0,L]  — >  IR2  is  a  sufficiently  smooth  curve  parametrized  by  its  arc  length  with  the 
unit  normal  and  the  unit  tangent  vectors  at  y(t)  denoted  by  N(t)  and  T(t),  respectively .  Suppose  further  that  the 
function  u  :  (R2  — >  ER  is  twice  continuously  differentiable.  Then  at  the  point  y(t ),  the  Laplacian  of  u  is  given  by 
the  formula 


j2  o2 

Au  =  N  •  Wu  •  N  -  c(t)N  •  Vw  + 

d  t2  dN(t)2 


c(i) 


8u  d2u 
dN(t)  +  dT(t)2  ’ 


where  the  curvature  c{t)  at  y(t)  is  defined  by  d2y/d/2  =  c(t)N(t). 


(33) 


2.5.  The  Poincar  e-Bertrand  formula 


For  a  fixed  point  x  £  (—1, 1),  we  will  consider  two  repeated  integrals 

*i 


ds, 


(34) 

(35) 


differing  from  each  other  only  in  the  order  of  integration.  Both  integrals  exist  almost  everywhere  for  a  fairly 
broad  class  of  functions.  However,  they  are  not,  in  general,  equal  to  one  another.  The  following  lemma 
establishes  the  connection  between  them  (see,  for  example  [17,21];  the  result  is  usually  referred  to  as  the 
Poincare-Bertrand  formula. 


Lemma  9.  Suppose  that  <px  £  LP[- 1, 1],  (p2  €  L*[-l,  1].  Then  if 

1  +  1  <1,  ’  (36) 

p  q 


then 
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LtB  (Lt=7 <k)d'  =  W ■  +  /, *<’> '  (/,  (7 7) d') ^  <37> 

for  almost  all  x  €  (—1,1). 

2.6.  Potential  theory 

In  this  section,  we  introduce  some  terminology  standard  in  potential  theory  and  state  several  technical 
lemmas  to  be  used  subsequently.  We  will  define  the  potential  GXo  :  U2  \  {*o}  ->  U  of  a  imit  charge  located  at 
the  point  E  U2  by  the  formula 

GXo(x)  =  log(  ||x  -  x0||).  (38) 

Suppose  that  y  :  [0,i]  — >  IF52  is  a  sufficiently  smooth  curve  parametrized  by  its  arc  length,  and  that  y  is  an 
open  curve  (i.e.,  y(0)  ^  y(L)).  The  image  of  y  will  be  denoted  by  f,  and  the  unit  normal  and  the  unit  tangent 
vectors  to  y  at  the  point  y(0  will  be  denoted  by  N(t)  and  T(t),  respectively.  Given  an  integrable  function 
a  :  [0,  L }  — >  R,  we  will  refer  to  the  functions  Sy<a  :R2-*(R  and  Dy<a,  Qy  „  :  R2  \  T  —*■  R,  defined  by  the  formulae 


Sy,a{x)  —  l  Gy(t)(x)  • 
Jo 

Dy'a{x)  =  l  dm 


(X)= 

7  Jo  dN(t)2 


<r(t)dt, 


as  the  single,  double,  and  quadruple  layer  potentials,  respectively. 

The  functions  (3 Gy{l)(x))/(dN(t)),  (d2Gy[i)(x))/(dN(t)2) :  R2  \  y(t) 
and  quadrupole  potentials,  respectively.  Obviously, 

QgrwM  (N(t),x-y(t)) 

dN(,)  ||x-y(r)||2  ’ 


:  are  often  referred  to  as  the  dipole 


9  2Gm(x)_  2{N(t),x  —  y(Q)2 

3A^(02  Ik -y  (Oil4 


Ik -y  (Oil2 


In  particular,  if  y  is  a  straight  line  segment  IL  —  [0,1]  on  the  real  axis,  then 

3  G,w(I(s)-h-N(s))  h 
3 N(s  +  0  h2  +  t2' 


#G,wm-h-N(s))  f-h2 

dN(s  +  02  {h2  +  t2f  1  ' 

The  following  two  lemmas  can  be  found  in  [14].  Lemma  10  states  a  standard  fact  from  elementary 
differential  geometry  of  curves;  Lemma  1 1  describes  the  local  behavior  on  a  curve  of  the  potential  of  a 
quadrupole  located  on  that  curve  and  oriented  normally  to  it. 
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Lemma  10.  Suppose  that  y  :  [0,L]  -+  IR2  is  a  sufficiently  smooth  curve  parametrized  by  its  arc  length  with  the 
unit  normal  and  the  unit  tangent  vectors  at  y(t)  denoted  by  N{t)  and  T(t),  respectively.  Then,  there  exist  a 
positive  real  number  [i  (dependent  on  y),  and  two  continuously  differentiable  functions  f,  g  :  (-/?,  (!)  — *•  1 
(dependent  on  y),  such  that  for  any  t  €  [0,1], 

y(t  +  s)  -  y(t)  =  (,-C(06;*  +  s4  •/(*))  •  T(t)  +  ^^-  +  s3  ■  g(sfj  ■ N(t ),  (46) 

for  all  s  €  (~P,P),  where  c(t)  in  (46)  is  the  curvature  of  y  at  the  point  y(t). 

Lemma  11.  Suppose  that  y :  [0,1]  -+  R2  is  a  sufficiently  smooth  curve  parametrized  by  its  arc  length.  Then, 
there  exist  real  positive  numbers  A,  P,  ho  such  that  for  any  s  €  [0,  L), 

9  2GW+«)(r(-g)  ~  h  ■  N(s))  t1  -h2  c-h-t2- (5h2  +  t2)  . 

QN(s  +  t)2  (h2  +  t2)2  ( h2+t 2f  "  ’  {  ’ 

for  all  t  €  (-/?,/?),  0</;  <  h0>  where  the  coefficient  c  in  (47)  is  the  positive  curvature  of  y  at  the  point  y(s). 

Similarly,  the  following  lemma  describes  the  local  behavior  on  a  curve  of  the  potential  of  a  dipole  located 
on  that  curve  and  oriented  normally  to  it;  it  also  describes  the  local  behavior  on  a  curve  of  the  tangential 
derivative  of  the  potential  of  a  charge  located  on  that  curve.  Its  proof  is  virtually  identical  to  that  of  Lemma 
11. 

Lemma  12.  Under  the  conditions  of  Lemma  11,  there  exist  real  positive  numbers  A,  /?,  h0  such  that  for  any 
s  e  [0 ,L), 


dGy(s+l)(y(s)  -  h  ■  N(s)) 

h 

<A, 

dN(s  +  t) 

V  +  fi 

QGy(s+l)(y(s)  -  h  ■  N(s)) 

t 

< A , 

dT(s  +  t) 

h2  +  t2 

(48) 

(49) 


for  all  t  £  ( — P ,  P),  0^h<  ho . 

We  will  define  the  function  :  IR2  \  T  — » IR  by  the  formula 

My,a(x)  =  Qy,„(x)  -  Dh„(x)  =  ^  ^  ~  '  dN(t)  ~ )  '  d/’ 

for  all  x  e  IR2  \  r  and  observe  that  is  the  difference  of  a  quadruple  layer  potential  and  a  weighted 
double  layer  potential  with  the  weight  equal  to  the  curvature  c(t).  The  following  theorem  is  a  direct 
consequence  of  Lemmas  1 1  and  12;  it  states  that  under  certain  conditions  the  function  A/7iff  defined  by  (50) 
can  be  continuously  extended  to  the  whole  plane  U2. 


Theorem  13.  Suppose  that  y  :  [0,  L]  — ►  U2  is  a  sufficiently  smooth  open  curve  parametrized  by  its  arc  length , 
and  that  a  :  [0,L]  — ►  U  is  a  function  continuous  on  [0,L],  whose  second  derivative  is  continuous  on  (0,1,).  Then 
the  function  My,a  can  be  continuously  extended  to  IR2  \  {y(0),  y(L)}  with  the  limiting  value  on  y(0 ,1)  defined  by 
the  formula 
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My,'{y{x))  -  f.p. 


fL  d2Gy{l)(y(x)) 
o  dN{t)2 


8Gy(,)(y(x)) 

dN(t) 


■  <r(f)  d/ 


(51) 


for  all  x  £  (0,Z).  Furthermore ,  if  a  satisfies  the  additional  condition  that 

K*)|<C.(*.(I-jc))\  (52) 

wit/;  some  C  >  0,  a  >  1  /or  a//x  €  [0,1],  then  My>„  can  be  further  continuously  extended  to  IR2  with  the  limiting 
values  on  y(0),  y(L)  given  by  the  improper  integrals 


M 


dN(t)2 

&Gh  o(y(L)) 
8 N(t)2 


~  c(t)  • 


8Gy«)(y(0)) 

dN{t) 


8iv(0 


(53) 

(54) 


respectively. 


Definition  14.  We  will  denote  by  E  the  linear  subspace  of  C[0,L],  consisting  of  functions  a  satisfying  the 
following  two  conditions: 

1.  a  is  twice  continuously  differentiable  on  (0,1); 

2.  a  satisfies  the  condition  (52). 


We  then  define  the  integral  operator  My :  E  — >  C[0,L]  via  the  formula 

My(ff)(x)  -  My,a(y{x)).  (55) 

The  following  lemma  states  that  the  operator  My  on  a  sufficiently  smooth  open  curve  y  is  a  compact 
perturbation  of  the  same  operator  MIl  on  the  line  segment  IL  =  [0,1]. 


Lemma  15.  Suppose  that  y  :  [0,1]  — ►  R2  is  a  sufficiently  smooth  open  curve  parametrized  by  its  arc  length. 
Suppose  further  that  the  operator  Ry  :  C[0,L]  C[0,L]  is  defined  by  the  formula 


Ry{o){x)=  f  r(x,t)-c(t)dt 
Jo 

with  the  function  r  :  [0,1]  x  [0,L]  — ►  IR  defined  by  the  formula 
,  A  &G.i[t){y{x))  8Gy(,)(y(jr))  82G4(()(x) 

r{X']~  dN(t)2  W'  mo  dN(t)2 

for  all  x  ft,  and  by  the  formula 

c(0 2 


r(t,0 


12 


for  all  x  =  t,  with  c(t)  denoting  the  curvature  of  y  at  the  point  y(t).  Then 

1  ,  . 

1  llv(*)-yMII2 


2<W(t),y(*)-y(Q)2 

lly(*)  -yWII4  llyW-y(0ll 


,  /a  (N(t),y(x)-y{t)) 
2+ci0 . — - 77775 — 


1 


(x-t)2 


(56) 


(57) 


(58) 


(59) 


for  allx  f  t.  Furthermore r  is  continuous  on  [0,L]  x  [0,£],  so  that  the  operator  Ry  is  compact  Finally ,  if  a  €  E 
(see  Definition  14  above),  then 


S .  Jiang,  V.  Rokhlin  /  Journal  of  Computational  Physics  191  (2003)  40-74 


49 


My(a)(x)  =  M,,(a)(x)  +  Ry(ff)(x). 


(60) 


Proof.  Eq.  (60)  follows  directly  from  the  combination  of  (51),  (56),  (57)  and  the  fact  that  the  curvature  is 
zero  everywhere  on  the  line  segment  h.  (59)  is  a  direct  consequence  of  (42),  (43),  (45),  (57).  In  order  to  prove 
that  r  is  continuous  on  [0,1]  x  [0,1],  we  start  with  observing  that  since  y  £  C2[0,L],  it  is  sufficient  to 
demonstrate  that 


1 


c(t)2 

lim/-(r  +  5,  /)=——. 

s— »0  IZ 

Replacing  *  in  (59)  with  t  +  s,  we  obtain 

r(t  +  s  t)  =  2W0.y(*  +  j)-y(Q)2  |  . . 

1  '  ||y(r  +  s)-y(/)ir  fly(f  +  *) -7(011 

Substituting  (46)  into  (62),  we  have 

A  W*)2  .  ./  x  j>(0 . 1  ~  d(s) 
r(t  +  s,t)~  +  c(0  •  +  s2  d(s) , 

where  the  functions  p,d :  — >  R  are  given  be  the  formulae 

p(0  =  ^r+5-s(5), 


,  r  %  W0,y(f+*)-y(0)  1 

1  U'  ||7(/  +  5)-7(0l|2 


(61) 


(62) 


(63) 


(64) 


d(s)=  +  ./(,))  +  (^  +  .2-^))  , 


(65) 


with  P  a  positive  real  number,  and  the  functions  /,  g  provided  by  Lemma  10.  Since  /,  g  are  continuously 
differentiable  on  (-/?,/?)  (see  Lemma  10),  we  have 


p(s)  c(t) 

™  d(s)  2  ’ 


lim 


1  -d(s)  c(t)2 


7^o  s2-d{s)  12  ' 

Now,  we  obtain  (61)  by  substituting  (66),  (67)  into  (63).  □ 


(66) 

(67) 


Remark  16.  A  somewhat  involved  analysis  shows  that  for  any  k  ^  1  and  y  £  C*+2[0,i],  the  function  r  (see 
(57)  above)  is  k  times  continuously  differentiable.  The  proof  of  this  fact  is  technical,  and  the  fact  itself  is 
peripheral  to  the  purpose  of  this  paper;  thus,  the  proof  is  omitted. 


3.  The  exact  statement  of  the  problem 

Suppose  that  y  is  a  sufficiently  smooth  open  curve,  and  that  the  image  of  y  is  denoted  by  F .  We  will  denote 
by  Sy  the  set  of  continuous  functions  on  R2  with  continuous  second  derivatives  in  the  complement  of  T,  i.e., 

sy  =  c2(r2  \  r)  n  c(R2).  (68) 
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We  will  consider  the  Dirichlet  problem  for  the  Laplace  equation  in  R2,  with  the  boundary  conditions 
specified  on  y: 

Given  a  function  /  :  T  — >  R,  find  a  bounded  solution  u  e  Sy  to  the  Laplace  equation 

Am  =  0  in  R2  \  T  (69) 

satisfying  the  Dirichlet  boundary  condition 

u=f  on  r.  (70) 

The  following  theorem  can  be  found  in  [15]. 

Theorem  17.  If  f  €  C2(T),  then  there  exists  a  unique  bounded  solution  in  Sy  to  the  problem  (69)  and  (70). 

Remark  18.  Certain  physical  problems  lead  to  modifications  of  the  problem  (69)  and  (70).  For  example,  the 
boundedness  of  the  solution  at  infinity  might  be  replaced  with  logarithmic  growth,  the  boundary  might 
consist  of  several  disjoint  components,  etc.  Extensions  of  Theorem  17  to  these  cases  are  straightforward, 
and  can  be  found,  for  example,  in  [17]. 


4.  Analytical  apparatus  I:  informal  description 

In  this  section,  we  will  present  an  informal  description  of  the  procedure.  We  assume  that  y  :  [—  1, 1]  — ►  R2 
is  a  sufficiently  smooth  “open”  (i.e.,  y(-l)  ^  y(l))  curve  with  the  parametrization 

y(0  =  y(§' (*+!))>  (71) 

where  L  is  the  total  arc  length  of  the  curve,  and  y  :  [0,L]  — >  U2  is  the  same  curve  parametrized  by  its  arc 
length.  The  image  of  y  will  be  denoted  by  T.  We  start  with  observing  that  the  solution  u  of  the  Dirichlet 
problem  (69)  and  (70)  must  satisfy  the  following  four  conditions: 

(a)  u  is  harmonic  in  U2  \  T; 

(b)  u  is  bounded  at  infinity; 

(c)  u  is  continuous  across  T; 

(d)  u  is  equal  to  the  prescribed  data  /  on  J\ 

Our  goal  is  to  construct  a  second  kind  integral  formulation  for  the  Dirichlet  problem  (69)  and  (70). 
Standard  approaches  in  classical  potential  theory  call  for  representing  u  in  IR2  \  T  via  single  or  double 
layer  potentials  so  that  conditions  (a),  (b)  are  automatically  satisfied,  and  conditions  (c),  (d)  lead  to  a 
boundary  integral  equation  via  the  so-called  jump  relations  of  single  and  double  layer  potentials  (see,  for 
example  [16]).  However,  in  the  case  of  an  open  curve,  if  u  is  represented  via  a  double  layer  potential,  the 
condition  (c)  cannot  be  satisfied  since  any  non-zero  double  layer  potential  has  a  jump  across  the 
boundary;  and  if  u  is  represented  via  a  single  layer  potential,  while  the  single  layer  potential  can  be 
continuously  extended  across  the  boundary,  the  condition  (d)  will  lead  to  an  integral  equation  of  th e  first 
kind.  Hence,  classical  tools  of  potential  theory  turn  out  to  be  insufficient  for  dealing  with  open  surface 
problems. 

It  is  shown  in  [14]  that  the  quadruple  layer  potential  has  a  jump  across  the  boundary  which  is  pro¬ 
portional  to  the  curvature  of  the  curve.  Combining  this  observation  with  the  well-known  fact  that  the 
double  layer  potential  has  a  jump  across  the  boundary  which  is  independent  of  the  curvature,  we  observe 
that  the  sum  of  a  quadruple  layer  potential  and  a  weighted  double  layer  potential  with  the  weight  equal  to 
the  curvature  given  by  the  formula 
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/'  ( 82(7y(')^  -  c(t)  ■  S~T-)  ■  ff(t) dt 

J- 1  V  3^(0  dN(‘)  J 


(72) 


can  be  continuously  extended  across  the  boundary.  However,  if  u  is  represented  via  (72),  then  the  condition 
(d)  will  lead  to  a  hypersingular  integral  equation.  It  is  also  shown  in  [14]  that  the  product  of  the  hyper¬ 
singular  integral  operator  with  the  single  layer  potential  operator  is  a  second  kind  integral  operator  in  the 
case  of  a  closed  boundary.  Thus,  one  is  naturally  lead  to  consider  the  operator  Py  defined  by  the  formula 


» = /’,  ( w ' c(,) '  ?§$r)  ‘  if', 108 "  - - 51 ' 


Obviously,  Py{a)  is  not  defined  when  xe  T,  and  we  will  define  the  operator  By  by  the  formula 
By(<r)(t)  =  lim  Py(a){x). 


x-+y(t) 


In  the  special  case  when  y  is  the  interval  /  =  [-1, 1]  on  the  real  axis,  (73)  assumes  the  form 
P,((r){x,y)  =  \  J  ^  log((x  -  s)2  log  \s  -  t\  ■  ds, 

and  the  operator  5/  is  defined  by  the  formula 


B,(<j)(x)  =  limP;(cr)(jc,y). 


(73) 


(74) 


(75) 


(76) 


The  operator  B,  turns  out  to  have  a  remarkably  simple  analytical  structure  (see  Section  5.4  below);  its 
natural  domain  consists  of  functions  of  the  form 


1 


•  <p(x)  + 


i 


lOgT-—! K*)> 


vT^3?  6i-^ 


(77) 


with  cp,  ip  smooth  functions,  and  when  restricted  to  functions  of  the  form  (77),  it  has  a  null-space  of  di¬ 
mension  2,  spanned  by  the  functions 


1 


(78) 


1 

Vl  -  x2 


•log 


l+x 

1  —  X  * 


(79) 


In  Section  5.4,  we  construct  a  generalized  (in  the  appropriate  sense)  inverse  of  Bp,  in  a  mild  abuse  of 
notation,  we  will  refer  to  it  as  By1. 

Now,  if  we  represent  the  solution  of  the  Problem  (69)  and  (70)  in  the  form 


u(x)  =  Py(o)(x), 


(80) 


then  the  conditions  (c)  and  (d)  will  lead  to  the  equation 

By(a)(t)  (81) 

with  a  the  unknown  density.  It  turns  out  that  (81)  behaves  almost  like  an  integral  equation  of  the  second 
kind;  the  only  problem  is  that  the  kernel  of  By  is  strongly  singular  at  the  ends.  Fortunately,  the  operator 
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By  =  By  O  BJ',  (82) 

restricted  to  smooth  functions,  is  a  sum  of  the  identity  and  a  compact  operator.  In  other  words,  By  is  a 
second  kind  integral  operator.  Therefore,  our  representation  for  the  solution  of  the  Problem  (69)  and  (70) 
takes  the  form 

u{x)  =  Py{tl)(x)  =  Py  O  Bp (t])(x),  (83) 

with  t]  the  solution  of  the  integral  equation 

By(nm=f(t).  (84) 

Finally,  we  remark  that  minor  complications  arise  from  the  non-uniqueness  of  Bp  (see  (78)  and  (79) 
above);  they  arc  resolved  in  Section  6.3. 


5.  Analytical  apparatus  II:  open  surface  problem  for  the  line  segment  /=  [—1,1] 

5.1.  The  integral  operator  P, 

Definition  19.  We  will  denote  by  Fj  the  set  of  functions  o :  (-1, 1)  — *  R  of  the  form 


<r(x)  = 


1 


Vl  -x2 


<p(x)  + 


1 


vr 


K*). 


(85) 


(86) 

(87) 


with  :  [-1, 1]  — » IR  twice  continuously  differentiable,  and  satisfying  the  conditions 

J  log |1  +  f|  •  o{t)dt  =  0, 

J  log |1  -  t\  ■  o(t)dt  =  0. 

We  will  consider  the  integral  operator  Pi :  Fi  — >  C2(R2  \7)  defined  by  the  formula 

P,(a)(x,y)  =  J  K,{x,y,  t )  •  o{t)dt  =  \  J  ^  Io8((*  “  sf  •  (J ^  log  I*  “  'I  *  d')  ds-  (88) 

Obviously,  Pi  converts  a  function  0  €  fy  into  a  quadruple  layer  potential  whose  density  D(o)  is  in  turn 
represented  by  a  single  layer  potential 


D(o)(x)  =  J  log  \x  — 1\  •  o(t) dr. 


(89) 


The  following  lemma  provides  an  explicit  expression  for  the  kernel  Ki  of  Ph 
Lemma  20.  For  any  a  e  /y, 


Kj(x,y,t )  = 


\y\  -  (arctan  (±^)  +  arctan  (x-t)-  (logoff  -  log-(-‘g^) 

—  —  —  |  ~~  3  —  ; 


((x-tf+y2) 


2  ((x-t)2+y>) 


(90) 


for  any  (x,y)  eU2\I  and  any  t  €  (-1,1). 
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Proof.  Since  log((x  -  j)2  +  y2)  satisfies  the  Laplace  equation  for  any  (x,y)  £  (s,  0),  we  have 
log((jc  -  s)2  +  /)  =  log((x  -  s)2  +/); 

substituting  (91)  into  (88)  and  integrating  by  parts  once,  we  obtain 

P,{o){x,y)  =  \J^S  1°S((*  —  s)2  +/)  •  (J ^  ^  l°gk-  *\  ■  & 

- 7^77 '  L 106 11  - ■  °m  -  <771777  •  L '°8 " + (| '  °m- 


(jc-  i y+y  u  '  w  (x+iy+y  u 

Combining  (92)  with  (86),  (87)  and  changing  the  order  of  integration,  we  have 

P/(<r)(x,y)  =  J  (jJ  log((j  -  xf  +  /)  •  ^  log  \s  - 1\  dsj  •  <r(f)  dt. 


Hence, 


Kj(x,y, t)=^  /  ®  log((«  -  x)2  +/)  •  ^  log  \s-t\ds  =  J- 


1  (s-x) 

x((s-x)2+y>)(s-t) 


Now,  (90)  follows  immediately  from  the  combination  of  (26)  and  (94).  □ 

5.2.  The  boundary  integral  operator  Bj 

We  will  define  the  integral  operator  Bt :  Fj  — *  L'[-l,  1]  (see  (85))  by  the  formula 

Bj(<t)(x)  =  limP/(ff)(^,y)  =  lim  f  K,(x,y,t)  ■  <r(t)dt. 
y- o  y-> o  J- 1 

The  following  lemma  provides  an  explicit  expression  for  Bh 
Lemma  21.  For  any  x  €  (-1, 1), 


B,(o)(x)  =  n2  ■  c(x)  +  J 


1  logj^-logi^ 


( j(t )  d  t. 


Proof.  Substituting  (90)  into  (95),  we  obtain 


r  \y\  • 

B,{o){x)  =  lim  f  — 


\y\  ■  (arctan  +  arctan 


«*-0z+J'2) 


lim  [ 

0  J_ 


i  (»-.)■  (logggy. 


2«> -«)'+>*) 


o(t)  d  t 


■  a(t)  dr. 
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Combining  (21)  with  the  trivial  identity 

lim  arctan  (-7-7-^  +  arctan  (-7^ )  =  7t,  xe  (-1, 1), 

r*  \  \y\  J  V  M  / 


(98) 


we  have 


fl  \y\  • 

lim  /  - 

0  J-i 


jyl  •  (arctan  (g)  +  arctan  (g)) 


ff{t)dt  =  n2-(r(x).  (99) 

-1  ((x-t)  +r) 

Now,  applying  Lebesgue’s  dominated  convergence  theorem  (see,  for  example  [18])  to  the  second  part  of  the 
right-hand  side  of  (97),  we  have 

2  ((x-tf+f) 

log 7^7  —  log 7^7 


lim 

y 


■ff(t)dt 


3  L 


T(t)dt  =  J_t 


x  —  t 


2  ((x-tf+f) 

Finally,  combining  (99),  (100)  with  (97),  we  obtain  (96).  □ 

Remark  22.  Elementary  analysis  shows  that 

liiulog£-log£f  1 _ l_= _ 2_ 

/-a:  X  —  t  \  —X  1+Jt  1  —  X2’ 

That  is,  the  only  singularities  of  the  integral  kernel  in  (96)  are  at  the  end  points  ±1. 


a (t) d t. 


(100) 


(101) 


5.3.  Connection  between  the  operator  Bj  and  the  finite  Hilbert  transform 


Lemma  23.  For  any  o  €  Fj  ( see  Definition  19), 

B,(c)(x)  =  -H2(<j)(x)  (102) 

for  all  x  €  (-1, 1). 

Proof.  Due  to  (12), 

»»»-/[  “  (/i  7^;  •<’«*)*•  003) 

Combining  (37)  with  (103),  we  have 

S1(,)(.)  =  -(.‘..(<)+/|  (/,(,-,)(, -,)dj)'‘r(,)d')-  <104) 

Now,  (102)  follows  immediately  from  the  combination  of  (25),  (96),  (104).  □ 

5.4.  The  inverse  of  H2  for  Chebyshev  polynomials 


In  Section  5.5,  we  will  need  the  ability  to  solve  equations  of  the  form  (19).  However,  due  to  Corollary  4, 
the  solution  to  (19)  is  not  unique.  The  purpose  of  this  section  is  Theorem  28,  stating  that  the  solution  to  (19) 
is  unique  if  restricted  to  the  function  space  F/  (see  Definition  19),  and  constructing  such  a  solution. 
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The  following  lemma  is  a  direct  consequence  of  Corollary  4  and  Lemma  6. 
Lemma  24.  For  any  integer  n  >  0  and  x  e  (-1, 1),  all  solutions  of  the  equation 
H\cn)  =  T„ 
are  given  by  the  formula 

/  \  ~  /  \  C0  C\  l+jc 

ffn(x)  =  (Tn(x)  +  /===  + 


6i-*’ 

tWfA  Co,  Ci  arbitrary  constants,  and  the  functions  o„  defined  by  the  formulae: 


oo(x)  = 


1 


log 


1+* 


n3  &i-x' 


and 


W  7t3  J-\  t-X 

-  ,  ,  1  R - 7  /■'  ^  2 


(105) 

(106) 

(107) 

(108) 
(109) 


l)rc3  Vl  —  JC2  ’ 

for  all  1. 

We  will  define  the  operators  J,L  :  C'[— 1, 1]  — ►  C[-l,  1]  via  the  formulae: 

JW)(X)  =  /j  log  7=  ■  (/]  S  <*)  <*. 

L{(p)(x)  =  j  ^  log  \x-t\-  Vl  -f2  •  ^  dyj  dt. 

The  following  lemma  provides  explicit  expressions  for  the  derivatives  of  J(<p),  L((p),  and  for  the  values  of 
J(cp),  L{(p)  at  the  points  -1,  1. 


(110) 


(111) 


Lemma  25.  For  any  tp  €  C'[—  1, 1], 


J'(cp)(x)  =  -n2  ■■ 


(p(x) 


L'((p){x)  =  -n2  ■  (p(x)  ■  \/l  -x2  +  n  J  <P(5)<k , 
for  any  x  e  (-1, 1),  and 

J(cp)(-l)  =  n-  J ^  -y(j)dj, 


(112) 

(113) 

(114) 
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L((p)(-l)  —  it  ■  J  cp{x)  •  (arccos(jc)  •  V\  -*2  +  log(2)  •  x  -  1) dx. 
L{(p){  1)  =  n  J  9(x)  ‘  ((arccos(x)  -  7t)  •  Vl  -x2  +  log(2)  •*  +  l)dx. 


(115) 

(116) 
(117) 


Proof.  The  identities  (1 14)— (1 17)  are  a  direct  consequence  of  (29)— (32)  in  Lemma  7,  respectively.  In  order  to 
prove  (112),  substituting  (1 10)  into  J'(<p)  and  interchanging  the  order  of  the  differentiation  and  integration, 
we  obtain 

(ns) 

Applying  (37)  to  the  right-hand  side  of  (118),  we  have 

-L&VM-Th*)*-  (1I9) 

Now,  (112)  follows  immediately  from  the  combination  of  (22),  (119).  The  proof  of  (113)  is  virtually 
identical  to  that  of  (112).  □ 

The  following  lemma  provides  explicit  expressions  for  J(T„),  with  n  =  0, 1, 2, . . . 


Lemma  26.  For  any  x  €  [-1, 1], 


J(T0)(x)  =  -  y  +  7i2  •  arccos(x) 


and 


J{T2n){x)=Yn'/V-^-U2n-x(x)> 


for  all  n  >  1. 

Proof.  Substituting  T0  into  the  Eqs.  (112)  and  (114),  we  obtain 

J'(ro)(t)dt  =  -^L, 


(120) 

(121) 

(122) 


(123) 

(124) 
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Now,  (120)  follows  immediately  from  the  combination  of  (123)  and  (124),  and  the  trivial  identity 


J(T0)(x)  =  J(T0)(-l)  +  J*S(T0)(t)dt. 


The  proofs  of  (121),  (122)  are  virtually  identical  to  the  proof  of  (120).  □ 

The  following  lemma  provides  explicit  expressions  for  L(U„),  with  n  =  0, 1 , 2, ...  It  is  a  direct  analogue  of 
Lemma  26,  replacing  the  mapping  J  with  the  mapping  L,  and  the  polynomials  T„  with  the  polynomials  U„. 
Its  proof  is  virtually  identical  to  that  of  Lemma  26. 

Lemma  27.  For  any  xe  [-1,1], 

2  3 

L(Uq)(x)  =  y  ■  (arccosx  -x  ■  Vl  -x2)  +  2n  x  -y  (126) 


l(U2„)(x)  =  j-VT=7i-(- 


Cfr-.W  U2nMV\  ,  2n 
2 n  2n  -f  2  )  2n  +  1  5 


r/rr  \ /  ^  ^  r - 5-  (U*,- i(x)  t/2„(x)'\  (2n\og2  4 n 


for  all  n  >  1. 


We  are  now  in  a  position  to  combine  the  identities  (27)  and  (28),  Lemmas  24,  26  and  27  to  obtain  a 
refined  version  of  Lemma  24.  The  following  theorem  is  one  of  principal  analytical  tools  of  this  paper. 

Theorem  28.  Suppose  that  for  each  n  =  0, 1 , 2, . . the  function  on  €  Fj  ( see  Definition  19)  is  the  solution  of  the 
equation 

H2{o„)  =  Tn.  (129) 


ctoM  = 


1  x  wl+x  2(log2+l)  1 

n3  Vl  -x2  °gl-x  7t3log2  V”  1  —  jc2 


ffiW  =rj- 


U0(t),  2x1  1  ,  : 

-^-^--7--==  +  ^— ==-log: 


•  X  n3  Vl  -x2  2jt3  Vl  -X2  1  -  x 


fT—S.  r  BuzMa, _ /2nlog2  4n  _ 1_ 

7-1  t-x  7i3log2  \4n2  —  1  (4n2  —  l)2  J  Vl  -  x2' 


lW-J 

u  ’  n3  y_,  t-x  (2n  + 


x  (2 n  +  1  )tc3  Vl  -x2 


for  all  n  >  1. 
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Proof.  The  identities  (135)— (137)  are  a  direct  consequence  of  the  identities  (27)  and  (28),  and  Lemmas  26 
and  27.  To  prove  (138),  we  first  observe  that  (see,  for  example  [2])  for  all  n  =  0, 1,2, . . 


U,(  1)=H  +  1, 

(139) 

C4(-1)  =  (-1)>  +  1). 

(140) 

It  immediately  follows  from  139  and  140  that 

Un(~  1)  £4-2  (-1)  n 

n  +  1  n  -  1 

(141) 

£4(1)  £4-2(1)  0 

n  -f  1  n  —  1 

(142) 

for  any  n  ^  2. 

Now,  we  observe  that  the  function 

w  «+  1  n  -  1 

(143) 

is  a  polynomial  of  degree  n,  and  that  the  points  x  =  ±1  are  the  roots  of  W  (see  (141)  and  (142)).  Therefore, 
there  exists  such  a  polynomial  p„~2  of  degree  n  -  2  that 

Un(X)  Un~l{x)  .  2\  /  \ 

n  +  i  n-  1 

(144) 

S.  Jiang,  V.  Rokhlin  I  Journal  of  Computational  Physics  191  (2003)  40-74 


59 


Finally,  we  obtain  (138)  by  substituting  (144)  into  (137).  □ 


5.5.  The  integral  equation  formulation  for  the  case  of  a  line  segment 

In  this  section,  we  will  combine  the  results  in  previous  four  sections  to  solve  the  Dirichlet  problem  for  the 
line  segment  1  =  [-1, 1]  on  the  real  axis.  The  following  lemma  is  a  direct  consequence  of  Theorems  13  and 
28,  and  Lemmas  23  and  29. 

Lemma  30.  For  any  function  f  e  C2[ —  1 , 1],  there  exists  a  unique  solution  a  &Fj  ( see  Definition  19)  to  the 
equation 

/I  10glr£  _  J0gi±£ 

,  —  t'-g(*)d* =/(*);  (145) 

in  other  words ,  the  operator  Bjx  is  well  defined  if  the  range  is  restricted  to  the  function  space  Fj.  Furthermore, 
if  f  is  orthogonal  to  To,  T\  with  respect  to  the  inner  product  (3),  then  the  function  Pi  (a)  can  be  continuously 
extended  to  R2. 

For  the  cases  f  —  T0,  f  =  Tu  we  have  the  following  lemma,  easily  verified  by  direct  calculation. 


Lemma  31. 

1.  The  only  bounded  continuous  solution  to  the  problem 

f  Aw  =  0  in  R2  \  7, 

\  u  =  1  on  I 

is 


u°,{x,y)  =  1. 


2.  The  only  bounded  continuous  solution  to  the  problem 


f  Au  =  0  in  R2  \  /, 
\  u  —  x  on  1 


is 


W ](x,y) 


N(x,y) 

D{x,y)  ’ 


with  the  functions  77,  D  : 


R  defined  by  the  formulae 


(146) 


(147) 


(148) 


(149) 


N(x,y)  =  fx+\f+y>  -  fix-1  f+f,  (150) 

D(x,y)  =  l^+j^  +  vV-  lf+72  +  !)2+y  +  v'V-  -4,  (151) 


respectively . 
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Combining  Lemmas  30  and  31,  we  immediately  obtain  the  following  theorem. 


Theorem  32.  Suppose  that  the  function  f  :  [-1,1]  -*  R  is  twice  continuously  differentiable.  Suppose  further 
that  the  function  o  £  7}  {see  Definition  19),  and  the  coefficients  A0,  A\  satisfy  the  following  equations: 

B,{a){x)  =  ti2  •  a{x)  +  J  ^  {  ^log  /(*  “  f)}  *  *(0*  =/(*)  ~Ao  ~At  (152) 

J  (f(x)-Ao-A,  ■  x)  ■  ~^J=  dx  =  0,  (153) 


J y(x)-A0-A]  ■x)--j==dx  =  0. 

Then  the  function  u  :  R2  -♦  R  defined  by  the  formula 

u{x,y)  =  P,{o)(x,y)  +A0  ■  u%x,y )  +  AX  ■  u){x,y) 

is  the  solution  of  the  problem 

f  Au  =  0  in  R2  \  7, 

\u-f  on  I. 


(154) 

(155) 

(156) 


Applying  Theorem  28,  we  can  now  solve  the  Dirichlet  problem  (156)  via  the  representation  (155). 


Corollary  33.  Under  the  conditions  of  Theorem  32,  the  solutions  to  the  Eqs.  (152)-(154)  are 

r'{/wW,,  Bo  .  BlX 

*  J 

k=2 


,(x)  =  ±-VT=7  frc>.  ['^A^it+-fL=+ 

W  n3  hi  J- 1  X~t 


vT 


(157) 


Ao  =  C, 


o, 


A\  =  Ci, 

where  the  coefficients  B0,  B\  are  defined  by  the  formulae 


Bn  = 


7t3  •  log  2 


fv  /2*log2 
V4F-1 


4* 


(4A2  -  1)" 


(158) 

(159) 

(160) 


_  2_  A  Ca+i 
rc3  ‘  h  2k  +  r 


respectively,  and  Ck  {k  =  0, 1,2, . . .)  are  /Ae  Chebyshev  coefficients  of  f  given  by  (9)  anrf  (10). 


(161) 


Remark  34.  It  immediately  follows  from  Lemma  29  that  the  function  P/(o)  with  a  given  by  (157)  has  an 
explicit  expression 

m(x,y)  =  /'  TT^r^ '  ^  (162) 

7-1  ((x-s)  +?) 

for  any  {x,y)  €  R2  \7,  with  the  function  D(o) :  [-1, 1]  -*•  R  defined  by  the  formula 
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Finally,  we  will  need  the  following  lemma. 

Lemma  35.  Suppose  that  the  operator  S  is  defined  by  the  formula 

S(*l)(x)  =  D(BJl(ij))(x)  =  J ^  log  |jc  —  /|  -Bj\ri)(t)dt,  (164) 

with  the  operator  Bj  defined  in  (96).  Then  S  is  a  bounded  linear  operator  from  C[—  1, 1]  to  C[—  1, 1]. 

Proof.  By  Lemma  29,  wc  have 

5(ro)W  =  -r^ 


s(r,)(x)  = 


and 


for  all  n  ^  2.  Substituting  (5)  into  (167),  we  obtain 

orr\/\-  1  /sin((w  +  1) arccos(jc))  sin((/i  -  1)  arccos(x))^ 

s(Tn){x)  2it‘V  n+l  B_1  )' 

for  all  n  ^  2.  Utilizing  the  trivial  fact  that  |  sin(w)|  <  1  for  any  real  number  u,  we  have 

for  all  n  =  0, 1,2, . . .  Now,  any  function  q>  €  C2[-l,  1]  can  be  expanded  into  a  Chebyshev  series 
<p(x)  =  f^Cn-T„(x), 

n=0 


and  by  Parseval’s  identity, 

rl  <P(*f 

_  ^ =  / . : 
n=0 


oc  p  1 

5c-=/, 


dx^n  •  || <plle 


i  s/\  -  x2 

Applying  Schwarz’s  inequality,  we  have 


IISMIL  <  £  Id  •  TOIL  « |  £  ~  IGI  <  ;  (£  ^ 


(165) 

(166) 

(167) 

(168) 

(169) 

(170) 

(171) 


l)2 


>/2  /  oo  \  */2 

|  ■  <2|ML. 


(172) 


Since  C2[-l,  1]  is  dense  in  C[— 1, 1],  S  is  bounded  from  C[— 1, 1]  to  C[— 1, 1].  □ 
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6.  Analytical  apparatus  III:  open  surface  problem  on  a  general  curve 

6.1.  The  integral  operator  Py 

In  this  section,  wc  consider  the  case  of  a  general  curve.  We  assume  that  y  :  [-1, 1]  — >  R2  is  a  sufficiently 
smooth  “open”  curve  with  the  parametrization  (71).  The  image  of  y  is  denoted  by  T.  We  will  consider  the 
operator  Py  :  Fi  C2(R2  \  T)  defined  by  the  formula 


Py(a){x)  =  J  ^  Ky(x,t)  ■  <r(t)dt 


_£  ['  ( 52Gy(j)(x)  8 GmW 

4  7_i  v  a;v(s)2  t[>  dN(s) 


log  | s  -  t\o{t)At )  ds, 


with  L  the  arc  length  of  y.  The  following  lemma  provides  an  explicit  expression  for  the  kernel  Ky.  Its  proof  is 
virtually  identical  to  that  of  Lemma  20. 

Lemma  36.  For  any  o€.Fi  ( see  Definition  19), 

(174> 

for  any  x  €  R2  \  T  and  t  €  (-1,1),  with  the  integral  in  (174)  intepreted  in  the  principal  value  sense. 

6.2.  The  boundary  integral  operator  By 

We  will  then  define  the  integral  operator  By :  Fj  —>  Lx{—  1, 1]  by  the  formula 

By{a) (t)  =  \\mPy(o)(y(t)  +  h  ■  N(t ))  =  lim  J ^  Ky(y(t)  +  h  •  N(t),s)  ■  o(s)<ls.  (175) 

The  following  lemma  is  a  direct  consequence  of  Lemmas  12  and  21;  it  provides  an  explicit  expression  for 


Lemma  37.  For  any  t  €  (-1,1), 


By(o)(t)  =  tc2  ■  <r(t)  +  J  K^,(t,s)  •  cr(s)dy, 


with  the  kernel  AT/” :  (—1,1)  x  (—1,1)  — >  [R  given  by  the  formula 


K*(t,s)  =  J^ 


MU  1 


' '  ’  '  J.x  0  T(x)  x-s~ ' 

with  the  integral  in  (177)  intepreted  in  the  principal  value  sense. 

6.3.  The  integral  equation  formulation  for  the  case  of  a  general  curve 


Similarly  to  the  operator  Bj  defined  in  (96),  the  kernel  K*  of  By  is  strongly  singular  at  the  end-points. 
Therefore,  if  the  solution  of  the  Dirichlet  problem  (69)  and  (70)  is  represented  by  the  function  Py(o)  on 
R2  \  f,  then  (70)  will  lead  to  a  boundary  integral  equation 
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By(cr)(t)=f(t),  (178) 

which  is  not  of  the  second  kind.  Because  of  the  obvious  similarity  of  the  operators  Bt,  By,  it  is  natural  to 
consider  the  operator  Py  :  C{- 1, 1]  — ►  C2(K2  \  T)  defined  by  the  formula 

py(n)(x)  =  pyoB7'(r,)(x).  (179) 

Obviously,  Py(t\)  is  not  defined  when  x  €  T,  and  we  will  define  the  operator  By  :  C[— 1, 1]  — >  C[-l,  1]  by 
the  formula 

By(r,)(t)  =  lim  ?,(«,)(*)  =  (180) 

AT-4V1/) 


The  following  theorem  is  one  of  principal  results  of  the  paper;  it  states  that  By  is  a  second  kind  integral 
operator  when  restricted  to  continuous  functions,  and  is  an  immediate  consequence  of  Lemmas  15  and  35. 


Theorem  38.  Suppose  that  y  :  [-1, 1]  —»  U2  is  a  sufficiently  smooth  “open”  curve  with  the  parametrization 
(71).  Suppose  further  that  the  operator  Ry :  C[-l,  1]  — ►  C[-l,  1]  is  defined  by  the  formula 


Ry{0\ 


)(x)  =  f_7(x,t) 


<r(f)  d/, 


(181) 


with  the  function  r  :  [-1, 1]  x  [-1, 1]  — >  R  defined  by  the  formula 


r{x,t) 


2(AT(0,r(*)-r(0)2 


+  • 


1 


llvW  -v(0ll4  llrW  -v(0ll 


+ 


l2-c(0  (N(t),y(x)-y(t)) 


llvW-KOII2  (*  -  02  ’ 


for  all  xy £  t,  and  by  the  formula 
L2-c(t)2 


r(t,t)=- 


48 


(182) 


(183) 


for  all  x  =  t,  with  L  the  arc  length  of  y,  and  c(t )  the  curvature  of  y  at  the  point  y{t).  Then , 


By(r,)(t)  =  (I  +  MM(t),  (184) 

with  I :  C[ —  1 , 1]  C[—  1, 1]  the  identity  operator,  andM  :  C[ — 1, 1]  — »  C{—  1, 1]  a  compact  operator  defined  by 

the  formula 

M(ri)(t)  =  (By  -  B,)  oBj\m  =  *yo5fa)(0,  (185) 

with  the  operators  Ry)S  :  C[-l,  1]  C[-l,  1]  defined  by  ((181),  (164)-(167)),  respectively. 


Remark  39.  It  immediately  follows  from  the  combination  of  (59)  and  (182)  that  the  operator  Ry  is  related  to 
Ry  defined  in  Lemma  15  by  the  formula 

^(5)W=^.»0(x+l)),  (186) 

with  5(0  =  +  1))>  an<l  the  function  7  is  related  to  the  function  r  defined  in  (59)  by  the  formula 


r(x,t)  =  —  ■  (x  +  1),- (t  +  1)^ 


(187) 
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The  function  Py(tj)  cannot,  in  general,  be  continuously  extended  to  the  whole  plane  IR2,  unless  the  density 
rj  satisfies  certain  additional  conditions.  The  following  lemma  is  a  direct  consequence  of  Theorems  13  and 
28,  and  Lemmas  23  and  29. 


Lemma  40.  Suppose  that  the  function  t\  €  C[-l,  1]  is  orthogonal  to  T0  and  T\  with  respect  to  the  inner  product 
(3).  Then  Py(r])  can  be  continuously  extended  to  IR2. 


Lemma  40  above  shows  that  the  solution  of  the  problem  (69)  and  (70)  cannot  be  represented  by  the 
function  Py{r\)  alone.  Indeed,  Py(t])(x)  decays  at  infinity  like  l/|x|,  whereas  Theorem  17  only  requires  that 
the  solution  of  the  problem  (69)  and  (70)  be  bounded  at  infinity.  Suppose  now  that  we  can  find  two 
functions  w",  ul  in  Sy  (see  (68))  such  that  the  following  condition  holds: 


det 


( {no,  T>) 

\  ini,  To) 


(lo,Ti)\ 

ini,Ti)J 


f=o, 


(188) 


with  j?0,  >h  the  solutions  to  the  equations 


=  «?(y(0). 


(189) 


5,(?i)(0  =  «|(y(0).  (19°) 

respectively,  and  the  inner  product  in  (188)  defined  by  (3).  Then  the  solution  of  the  problem  (69)  and  (70) 
can  be  represented  by  the  formula 

u(x)  =  Py(ri)(x)  +v4o  •  +At  ■  nj(x),  (191) 

so  that  the  density  tj,  while  satisfying  the  boundary  integral  equation 

By{n){t)  =f{t)  —Ao  ■  uy(y(t))  -A,  ■  Uy(y(t)),  (192) 

is  also  orthogonal  to  T0  and  T\.  The  following  lemma  provides  such  two  functions  indirectly;  it  describes  a 
single-layer-potential  representation  for  the  functions  Py{T„ )  (n  =  2, 3,  — ). 

Lemma  41.  Suppose  that  y  :  [-1, 1]  K2  is  a  sufficiently  smooth  “open”  curve  with  the  parametrization 
(71).  Then  for  any  n  —  2,3, , 

log^-vwi^*,  <193) 


for  any  x  &  T. 

Proof.  Combining  (179),  (173),  (164),  we  have  the  identity 


Py(l) M 


-?/:t 


62Gy(<)(x)  _  9<Jr())(x) 

0 N(t)2  U  dN(t) 


)  -S(n)m, 


for  an  arbitrary  rj  €  C[-l,  1].  In  particular, 
(&Gy(l){x) 


Py(Tn)(x)  ■ 


Z7 
:  4 


['  (&Gm(x)  8GyW(x)\ 

L  { dN(t)2  [  )  dN{t)  ) 


■S(Tn)(t)  dt. 


(194) 


(195) 
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Since  the  function  Gy(,)(x)  satisfies  the  Laplace  equation  for  all  x  y(i),  applying  (33)  to  Gy,)  and  car¬ 
rying  out  elementary  analytic  manipulations,  we  obtain  the  identity 


)(*)  -/.a 8gv(d(*A  WW 

\2  C(,)  I  ~  Qf2 


8A j{ty  '  j  m*) ) 

and  substitution  of  (196),  (167)  into  (195)  yields  the  identity 
W”)W  2  7t  J 3  f-  V  1,77  +  1  n-\  ) 


(196) 


(197) 


Now,  we  obtain  (193)  by  integrating  by  parts  twice  the  right-hand  side  of  (197).  □ 

The  following  lemma  is  an  immediate  consequence  of  Lemma  41  and  the  well-known  fact  that  the 
functions  «"  :  IR2  — ^  R  (n  =  0, 1,2, . . .)  defined  by  the  formulae 

u°y(x)  =  1,  (198) 


u"y(x)  =  J  log[x-y(f)|  --^Ldf,  77  =  1,2,...  (199) 

form  a  complete  basis  for  the  space  Sy  (see,  for  example  [15]). 

Lemma  42.  Suppose  that  y  :  [-1, 1]  — ►  U2  is  a  sufficiently  smooth  “open”  curve  with  the  parametrization 
(71).  Then  the  functions  m°,  wj  defined  by  (198)  and  (199)  satisfy  the  condition  (188)— (190). 

Finally,  we  summarize  our  analysis  for  the  case  of  a  general  curve  by  the  following  theorem. 

Theorem  43.  Suppose  that  y  :  [-1, 1]  — ►  IR2  is  a  sufficiently  smooth  “open”  curve  with  the  parametrization 
(71),  and  that  the  function  f  :  [ — 1, 1]  — ►  R  is  twice  continuously  differentiable .  Suppose  further  that  the 
function  rj :  [-1, 1]  — >  U,  and  the  coefficients  Ao,  A\  satisfy  the  equations 

By(ri)(t)  =  (I  +  Ry  oS)(r,)(t)  =f(t)  -A0  ■  u°y(y(t))  -  A,  ■  u\{y{t)),  (200) 


1 

VT^T2 


dt  =  0, 


(201) 


/_!’,(')'7r=7d,=0'  (202) 

with  I  the  identity  operator,  and  the  operators  Ry,S  :  C[—  1, 1]  — >  C[-l,  1]  defined  by  (181)  and  (164),  re¬ 
spectively.  Then  the  function  u  :  R2  — >  IR  defined  by  the  formula 

u(x)  =  Py(t]) (x)  +A0  ■  uy(x)+A,  ■  uy (x)  (203) 

is  the  solution  of  the  problem 

f  Au  =  0  777  IR2  \  T,  (204) 

\u=f  on  T, 

in  (203),  the  operator  Py :  C[— 1,1]  — >  C(R2)  is  defined  by  (179),  (173),  (145),  and  the  functions  uy,  uy  are 
defined  by  (198)  and  (199)  respectively. 
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Remark  44.  For  the  case  of  several  open  curves  r  =  7i>  the  following  modifications  should  be  made. 


Instead  of  (203),  the  function  u  will  be  given  by  the  formula 

«(*)  =  +  4  ■ «°  (*)  +4  ■  “!•,(*)}  +  C,  (205) 

with  C  a  real  number  to  be  determined,  and  the  functions  uj(  (i  =  1, . . .  ,m;  n  =  0, 1)  defined  by  the  formula 
<(*)  =  J't  log  |x  -  y#(0l '  (206) 

The  functions  and  the  coefficients  A'0,  A[,  C  are  determined  as  the  solution  of  the  system  of  equations 

BMW  =  ('  +  4,  °  S)(ti,m  =  /<(')  -  E  Uo  •  <(%«)  -  A  ■  <(V,'W)}  -  c,  (207) 

j= i 

J-fM'7T^?dt  =  0,  {208) 

/_i^-7f=Fd/  =  0’  (209) 

E4  =  0.  (210) 

1=1 


Clearly,  the  functions  u ®  defined  by  (206)  are  linearly  independent;  the  constraint  (210)  and  the  constant 
term  C  are  introduced  so  that  the  function  u  is  bounded  at  infinity. 


7.  Numerical  algorithm 

In  this  section,  we  construct  a  rudimentary  numerical  algorithm  for  the  solution  of  the  Dirichlet  problem 
(69)  and  (70)  via  the  Eqs.  (200H202).  Since  the  construction  of  the  matrix  and  the  solver  of  the  resulting 
linear  system  are  direct,  the  algorithm  requires  O (N3)  work  and  0(A^2)  storage,  with  N  the  number  of  nodes 
on  the  boundary.  While  standard  acceleration  techniques  (such  as  the  Fast  Multipole  Method,  etc.)  could 
be  used  to  improve  these  estimates,  no  such  acceleration  was  performed,  since  the  purpose  of  this  section 
(as  well  as  the  following  one)  is  to  demonstrate  the  stability  of  the  integral  formulation  and  the  convergence 
rate  of  a  very  simple  discretization  scheme. 

By  Theorem  43,  the  equations  to  be  solved  are  (200H202),  where  the  unknowns  are  the  function  rj,  and 
two  real  numbers  A0,  A\.  To  solve  (200)-(202)  numerically,  we  discretize  the  boundary  into  N  Chebyshev 
nodes  and  approximate  the  unknown  density  t]  by  a  finite  Chebyshev  series  of  the  first  kind, 


(2ii) 

*= o 

with  the  coefficients  Ck  (k  =  0, ...  ,N  -  1)  to  be  determined.  In  order  to  discretize  (200),  we  start  with 
observing  that  by  (165)— (167),  the  action  of  the  operator  S  on  the  function  r]  is  described  via  the  formula 
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N-l  (N- \  \  0 

EEvd' 

*=o  \j=o  J 


Uk(x)  ■  VI  -x2, 


where  the  matrix  B  =  ( Bkj )  {kj  =  0, . . .  ,N  -  1)  is  given  by  the  formulae 


(212) 


Boo  — 

Bkk  —  —  31 1 
BkM 2  —  4k  > 

BkJ  —  0, 


iaa-i, 

O^k^N  —  3, 
otherwise. 


(213) 


In  other  words,  given  a  function  r]  expressed  as  a  Chebyshev  series  of  the  first  kind,  (212)  expresses  S(rj) 
as  a  Chebyshev  scries  of  the  second  kind.  Now,  it  is  natural  to  approximate  the  operator  Ry  by  an  ex¬ 
pression  converting  functions  of  the  form 


N- 1 

•*/*(') 


A=0 


(214) 


into  functions  of  the  form 

X>-TO,  (215) 

k= 0 

with  the  product  Ry  o  S  converting  expressions  of  the  form  (215)  into  expressions  of  the  same  form.  Thus, 
we  approximate  the  kernel  r(x,t)  (see  (182))  of  the  operator  Ry  with  an  expression  of  the  form 

7(x,t)  ^  ■  Ux)  ■  Uj(t).  (216) 

x=0  J~  0 

Clearly,  the  coefficients  Ky  have  to  be  determined  numerically,  since  the  curve  T  is  user-specified,  and  is 
unlikely  to  have  a  convenient  analytical  expression.  Thus,  we  obtain  the  coefficients  Ki}  by  first  constructing 
the  N  x  N  matrix  R  =  (r(x„  tj))  ( ij  =  0, 1, . . . ,  N  -  1)  with  x,(i  =  0, 1, . . . ,  N  -  1)  the  Chebyshev  nodes 
defined  by  (4)  and  tj{J  =  0, . . . ,  N  -  1)  the  Chebyshev  nodes  of  the  second  kind  defined  by  (7),  then  con¬ 
verting  R  into  the  matrix  K  =  {Kij){i,j  =  0, 1, ...  ,N  -  1)  by  the  formula 


K=URV , 

(217) 

with  N  x  N  matrices  U  =  (Uy),  V  —  (Vy)  defined  by  the  formulae 

/  Uoj  =  £ •  To{xj),  j  =  0,l,...,N  —  1, 

\  Uy  =  l-T,(xj),  i  =  1, . . .  ,N  —  1,  j  =  —  l, 

(218) 

v‘>-irrv sin'  (iNN +i  *)  •  u>m,  u = o.  i, . . .  ,n  i. 

(219) 

respectively.  We  then  approximate  the  prescribed  Dirichlet  data  /  by  its  Chebyshev  approximation  of  order 
N-l 


(220) 
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where  the  coefficients  fk  can  be  obtained  by  first  evaluating  /  at  Chebyshev  nodes  xh  then  applying  to  it  the 
matrix  U  defined  by  (218),  i.e., 


N—\ 


fk  —  ^ki 

i=0 

Similarly,  we  approximate  the  function  «’  (see  (199))  with  an  expression  of  the  form 


Jfc-0 


with  the  coefficients  uk  defined  by  the  formula 

A'-l 


Uk  =  Uki '  “y(y(x'))« 


(221) 


(222) 


(223) 


(=0 


with  x,  the  Chebyshev  nodes  defined  by  (4).  Combining  (212),  (216),  (221),  and  (222),  we  discretize  (200) 
into  the  equation 


(224) 


A ■ 

_ 

+  Aq  ■ 

n\ 

° 

:  +** 

So  \ 

«i 

. 

A 

A 

\  CV-i  / 

Vo/ 

\um-i) 

\A- 

with  N  x  N  matrix  A  defined  by  the  formula 
A  =  In  +  K  •  B, 

with  In  the  N  x  N  identity  matrix.  Furthermore,  (201)  and  (202)  lead  to  the  equations 

Co  =  0, 

Ci  =0. 


(225) 

(226) 
(227) 


Finally,  combining  (224),  (226),  (227),  we  obtain  the  following  Unear  system  of  dimension  N  +  2  to  be 
solved 


(228) 


/I  00. ..0  0  0  \ 

Co  \ 

0  \ 

0  1  0...0  0  0 

Ci 

0 

1  «o 

; 

f8 

A  0  i/i 

Cv-i 

— 

A 

•  • 

A0 

\ 

0  UN- 1 ) 

Ai  y 

\/at-1  / 

Remark  45.  Having  solved  (228)  with  any  standard  solver  (we  used  DGECO  from  UNPACK),  we  can 
compute  the  solution  of  the  Problem  (69)  and  (70)  at  any  point  in  R2  via  (203). 

Remark  46.  The  algorithm  can  be  generalized  to  the  case  when  the  boundary  consists  of  several  disjoint 
open  curves,  and  the  generalization  is  straightforward  (see  Remark  44). 
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8.  Numerical  examples 


A  FORTRAN  code  has  been  written  implementing  the  algorithm  described  in  the  preceding  section.  In 
this  section,  wc  demonstrate  the  performance  of  the  scheme  with  several  numerical  examples.  We  consider 
the  problem  in  electrostatics:  the  boundary  is  made  of  conductor  and  grounded,  the  electric  field  incident 
on  the  boundary  is  generated  by  the  sources  outside  the  boundary;  that  is  to  say,  there  are  three  fields 
present:  the  incident  field  ul9  the  reflected  field  wr,  and  the  total  field  ux  =  u\  +  «r,  where  ut  —  0  on  the 
boundary,  and  ur  —  -U[  on  the  boundary  and  is  harmonic  elsewhere.  For  these  examples,  we  plot  the 
equipotential  lines  of  the  total  field  and  present  tables  showing  the  convergence  rate  of  the  algorithm  by 
computing  the  errors  of  the  reflected  field. 

Remark  47.  In  the  examples  below,  the  problems  to  be  solved  via  the  procedure  of  the  preceding  section 
have  no  simple  analytical  solution.  Thus,  we  tested  the  accuracy  of  our  procedure  by  evaluating  our  so¬ 
lution  via  the  formula  (203)  at  a  large  number  M  of  nodes  on  the  boundary  f  (in  our  experiments,  we 
always  used  M  =  4000),  and  comparing  it  with  the  analytically  evaluated  right-hand  side.  We  did  not  need 
to  verify  the  fact  that  our  solutions  satisfy  the  Laplace  equation,  since  this  follows  directly  from  the  rep¬ 
resentation  (203). 


In  each  of  those  tables,  the  first  column  contains  the  total  number  N  of  nodes  in  the  discretization  of 
each  curve.  The  second  column  contains  the  condition  number  of  the  linear  system.  The  third  column 
contains  the  relative  L1  error  of  the  numerical  solution  as  compared  with  the  analytically  evaluated  Di- 
richlet  data  on  the  boundary.  The  fourth  column  contains  the  maximum  absolute  error  on  the  boundary.  In 
the  last  two  columns,  we  list  the  errors  of  the  numerical  solution  as  compared  with  the  numerical  solution 
with  twice  the  number  of  nodes,  where  the  solution  is  evaluated  at  4000  equispaced  points  on  a  circle  of 
radius  1.4  centered  at  the  origin;  the  fifth  column  contains  the  relative  1?  error,  and  the  sixth  column 
contains  the  maximum  absolute  error. 


Example  1.  In  this  example,  the  boundary  is  the  fine  segment  parametrized  by  the  formula 


UW  =  -0.2, 


-1</<1. 


(229) 


The  Dirichlet  data  are  generated  by  a  unit  charge  at  (0,0).  The  numerical  results  are  shown  in  Table  1. 
The  source,  curve  and  equipotential  lines  are  plotted  in  Fig.  1 . 


Example  2.  In  this  example,  the  boundary  is  an  elliptic  arc  parametrized  by  the  formula 


f  x(f)  =  0.8cos(/), 

\y(t)  =  0.5  sin(f)  +  0.25, 


-TtsSr^O. 


(230) 


Table  1 

Numerical  results  for  Example  1 


N 

K 

£2(0 

£“(0 

E\u) 

£”“(«) 

4 

0.5242?  +  01 

0.288E  +  00 

0.607E  +  00 

0.5132?  —  01 

0.5902?  —  01 

8 

0.450£  +  01 

0.7032?  -  01 

0.178E  +  00 

0.6132?  —  02 

0.6862?  —  02 

16 

0.388£  +  01 

0.759E-02 

0.212E-01 

0.1332?  —  03 

0.1462?  —  03 

32 

0.344£  +  01 

0.1652?  -  03 

0.4862?  —  03 

0.115E-06 

0.1262?  —  06 

64 

0.318£  +  01 

0.1472?-  06 

0.4462?  —  06 

0.1462?  —  12 

0.1642?  —  12 

128 

0.303£  +  01 

0.252E  -  12 

0.839E-12 

0.2502?  —  13 

0.2652?  -  13 

N 

K 

£J(r) 

£°°(r) 

E2(u) 

Ex(u) 

4 

0.5132s  +  01 

o.isos  +  oo 

0.1242s  4-  00 

0.3432s  —  01 

0.1 66E  -  01 

8 

0.4612s  4*  01 

0.722 E  -  01 

0. 5542s  “01 

0.6682s  —  02 

0.333£  -  02 

16 

0. 3992s  +  01 

0.1032s  —  01 

0.8332s  —  02 

0.1552s -03 

0.7732s  -  04 

32 

0.352£  +  01 

0.230£-03 

0. 1 87£  —  03 

p 

oo 

O 

-4 

0.4262?  —  07 

64 

0.316£  +  01 

0.128£-06 

0.1052s  —  06 

0.4752s  —  13 

0.201E-  13 

128 

0.301£  +  01 

0.141£-  12 

0.1342? -12 

0.2722s  —  13 

0.1022s  —  13 

The  Dirichlet  data  are  generated  by  one  positive  charge  of  unit  strength  at  (0,0)  and  another  negative 
charge  of  unit  strength  at  (0,-0.5).  The  numerical  results  are  shown  in  Table  2.  The  sources,  curve,  and 
equipotential  lines  are  plotted  in  Fig.  2. 


Example  3.  In  this  example,  the  boundary  is  a  spiral  parametrized  by  the  formula 


r  x(f)  _  t  cos(3 .30  0.1,  0  2^  1.2 

\y(0  =  tsin(3.3t), 


(231) 


The  Dirichlet  data  are  generated  by  a  unit  charge  at  (0,0).  The  numerical  results  are  shown  in  Table  3. 
The  source,  curve,  and  equipotential  lines  are  plotted  in  Fig.  3. 


Example  4.  In  this  example,  we  consider  the  case  of  several  open  curves.  The  boundary  consists  of  three 
elliptic  arcs  parametrized  by  the  formulae 

/*,(*)  =  U  cos(t)-l,  _JL<t<* 

\  yi  (0  =  sin(0  +  0.5,  12"  "4’ 


(232) 


N 

K 

e\d 

£»(r) 

E2{u) 

£“(«) 

8 

0.325£  +  02 

0.215£-01 

0.323£  —  01 

0.478£  4-  00 

0.426£  +  00 

16 

0.579£  +  01 

0.549£  —  03 

0.986£-03 

0.658£  -  01 

0.820£  —  01 

32 

0.478£  +  01 

0.21  I£-  05 

0.31 7£  -  05 

0.1 49£  —  02 

0.1 94£  —  02 

64 

0.424ZT  +  01 

0.987£  -  11 

0.122£  -  10 

0.350£  -  06 

0.453£  —  06 

128 

0.3925  +  01 

0.861£  —  13 

0.520£  —  12 

0.127£T  —  12 

0. 1 1 9£  —  12 

256 

0.374£  +  01 

0.138£  —  12 

0.139£  -  11 

0.139£-12 

0.123£  —  12 

i 

\  x2(t)  =  l.lcos(t), 
[^(r)  =  sin(0-  1.2, 

7ji _ 1  In 

12  12  ’ 

(233) 

f  X3  (t)  =  1.1  cos(r)  +  1, 
[  yilt)  =  sin(r)  +  0.5, 

hs* 

/A 

/A 

5n 

_  12  ’ 

(234) 

The  Dirichlet  data  are  generated  by  a  unit  charges  at  (0,0).  The  numerical  results  are  shown  in  Table  4, 
where  N  is  the  number  of  nodes  on  each  curve.  The  source,  curves,  and  equipotential  lines  are  plotted  in 
Fig.  4. 


Remark  48.  The  above  examples  illustrate  the  superalgebraic  convergence  of  the  scheme  for  smooth  data 
and  curves  (see  Remark  16  in  Section  2.6).  The  number  of  nodes  needed  depends  on  the  complexity  of  the 
underlying  geometry  and  the  smoothness  of  the  prescribed  data.  The  condition  number  of  the  resulting 
linear  system  is  usually  very  low. 
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Fig.  3.  Source,  curve,  and  equipotential  lines  for  Example  3. 


Table  4 

Numerical  results  for  Example  4 


N 

K 

E\r) 

E°°(r) 

E2(u) 

£°°(u) 

4 

0.8455  +  01 

0.1135-01 

0.2285  -  01 

0.4935-03 

0.1175-02 

8 

0.7545  +  01 

0.1265-03 

0.2695  -  03 

0.1595-05 

0.1085-04 

16 

0.6895  +  01 

0.1735-07 

0.3905-07 

0.6565-  10 

0.4525-09 

32 

0.6495  +  01 

0.4435-  12 

0.1965-11 

0.9505-  13 

0.1135-12 

64 

0.6275  +  01 

0.6585-13 

0.2955-  12 

0.4925-14 

0.4335-14 

128 

0.6155  +  01 

0.8805-  13 

0.3565-  12 

0.9685-14 

0.9715  -  14 

9.  Conclusions  and  generalizations 

We  have  presented  a  stable  second  kind  integral  equation  formulation  for  the  Dirichlet  problem  for  the 
Laplace  equation  in  two  dimensions,  with  the  boundary  condition  specified  on  a  curve  (consisting  of  one  or 
more  separate  segments).  The  resulting  numerical  algorithm  converges  superalgebraically  if  both  the 
boundary  data  and  the  curves  are  smooth.  Obviously,  the  combination  of  the  Fast  Multipole  Method  (see, 
for  example,  [7])  and  any  standard  iterative  solver  yields  an  0(N)  algorithm,  with  N  the  number  of  nodes 
on  the  boundary. 

The  extensions  of  the  scheme  of  this  paper  to  other  boundary  conditions  (such  as  Neumann  condition, 
Robin  condition,  etc.)  specified  on  an  open  curve  T  in  U2  are  fairly  straightforward.  For  the  Neumann 
problem,  representing  the  solution  in  the  form  of  a  double  layer  potential,  one  obtains  a  hypersingular 
integral  equation  on  T.  Its  subsequent  preconditioning  by  a  single  layer  potential  yields  a  second  kind 
integral  equation  (SKIE).  For  a  Robin  problem,  one  obtains  an  SKIE  formulation  by  representing  the 
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Fig.  4.  Source,  curves,  and  equipotential  lines  for  Example  4, 


solution  via  an  appropriate  linear  combination  of  single  and  double  layer  potentials,  with  a  further  pre¬ 
conditioning  by  a  single  layer  potential.  Furthermore,  the  approach  of  this  paper  can  be  applied  almost 
without  modification  to  elliptic  PDEs  other  than  the  Laplace  equation  (such  as  Helmholtz  equation, 
Yukawa  equaiton,  etc.).  Indeed,  the  Green’s  function  for  any  such  equation  has  the  form 

G{x,y)  =  <t>(x,y)  •  log(||x-y||)  +  \]/{x,y),  (235) 

with  <p,  0  a  pair  of  smooth  functions,  and  0(0, 0)  =  1/(2tt)  (see,  for  example  [4]).  When  the  procedure  of 
Section  6  of  this  paper  is  applied  to  a  Green’s  function  of  the  form  (235)Jhe  result  is  virtually  identical  to  that 
obtained  in  Section  6.3,  except  for  the  change  in  the  compact  operator  Py  in  (179).  However,  the  convergence 
rate  of  the  numerical  scheme  of  Section  7  deteriorates  drastically,  since  in  this  case  the  kernel  K  of  the  op¬ 
erator  Py  in  (179)  is  logrithmically  singular  (while  for  the  Laplace  equation,  it  is  smooth).  High-order  dis¬ 
cretization  schemes  for  such  integral  equations  can  be  found  in  the  literature  (see,  for  example  [1,12,20]). 

Needless  to  say,  three-dimensional  versions  of  most  problems  of  mathematical  physics  are  of  more 
immediate  applied  interest  than  their  two-dimensional  versions.  Thus,  the  results  of  this  paper  should  be 
viewed  as  a  model  for  the  investigation  of  the  Dirichlet  problem  for  the  Laplace  equation  (or  some  other 
elliptic  PDE)  in  three  dimensions,  with  the  data  specified  on  an  open  surface  S.  When  the  boundary  S  is 
smooth,  the  transition  is  fairly  straightforward;  it  becomes  more  involved  when  S  itself  has  comers.  Both 
cases  are  presently  under  investigation. 
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Abstract 

A  second  kind  integral  equation  formulation  is  presented  for  the  Dirichlet  problem  for  the  Laplace  equation  in  two 
dimensions,  with  the  boundary  conditions  specified  on  a  collection  of  open  curves.  The  performance  of  the  obtained 
apparatus  is  illustrated  with  several  numerical  examples.  The  formulation  is  a  simplification  of  the  equation  previously 
constructed  by  the  authors. 

©  2003  Elsevier  Inc.  All  rights  reserved. 

Keywords:  Open  surface  problems;  Laplace  equation;  Second  kind  integral  equation;  Dirichlet  problem 
AMS:  65R10;  77C05 


1.  Introduction 

Integral  equations  have  been  one  of  principal  tools  for  the  numerical  solution  of  scattering  problems 
for  more  than  30  years,  both  in  the  Helmholtz  and  Maxwell  environments.  Historically,  most  of  the 
equations  used  have  been  of  the  first  kind,  since  numerical  instabilities  associated  with  such  equations 
have  not  been  critically  important  for  the  relatively  small-scale  problems  that  could  be  handled  at  the 
time. 

The  combination  of  improved  hardware  with  the  recent  progress  in  the  design  of  “fast”  algorithms  has 
changed  the  situation  dramatically.  Condition  numbers  of  systems  of  linear  algebraic  equations  resulting 
from  the  discretization  of  integral  equations  of  potential  theory  have  become  critical,  and  the  simplest  way 
to  limit  such  condition  numbers  is  by  starting  with  second  kind  integral  equations.  Hence,  increasing  in¬ 
terest  in  reducing  scattering  problems  to  systems  of  second  kind  integral  equations  on  the  boundaries  of  the 
scatterers. 
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During  the  last  several  years,  satisfactory  integral  equation  formulations  have  been  constructed  in  both 
acoustic  (Helmholtz  equation)  and  electromagnetic  (Maxwell’s  equations)  environments,  whenever  all  of 
the  scattering  surfaces  are  “closed”  (i.e.,  scatterers  have  well-defined  interiors,  and  have  no  infinitely  thin 
parts).  In  this  paper,  we  describe  a  second  kind  integral  equation  formulation  for  the  Dirichlet  problem  for 
the  Laplace  equation  with  boundary  data  specified  on  a  collection  of  “open”  curves.  We  start  with  con¬ 
structing  the  right  inverse  of  the  single  layer  potential  operator  on  a  line  segment  via  simple  analytic  means; 
then  we  apply  such  operator  as  a  preconditioner  for  the  single  layer  potential  operator  on  the  curve 
considered  to  obtain  a  second  kind  integral  operator. 

Remark  L  In  a  recent  paper  [7],  the  authors  construct  a  somewhat  different  procedure  for  the  solution  of 
problems  of  the  classical  potential  theory  with  data  specified  on  a  collection  of  open  surfaces.  While  the 
approach  of  the  present  paper  is  very  similar  to  that  of  [7],  in  [7],  the  single  layer  potential  is  used  to 
precondition  the  quadruple  layer  potential  from  the  right;  here,  the  quadruple  layer  potential  is  used  to 
precondition  the  single  layer  potential  from  the  right.  For  technical  reasons,  the  latter  leads  to  a  drastically 
simplified  numerical  procedure  (and  also,  requires  simpler  analysis);  hence,  this  sequel  to  [7], 

Remark  2.  As  observed  by  one  of  referees  to  this  paper,  a  second  kind  integral  equation  is  constructed  in 
[11]  (Chapter  16)  in  the  Laplace  environment.  In  [11],  the  solution  of  the  Dirichlet  problem  is  represented 
via  the  real  part  of  the  Cauchy’s  integral  and  the  resulting  boundary  equation  is  a  singular  integral 
equation.  A  second  kind  integral  equation  is  then  obtained  by  applying  the  inverse  operator  of  the  Cauchy’s 
integral  operator  from  the  left  to  both  sides  of  the  equation.  However,  the  scheme  of  [11]  cannot  be  easily 
extended  either  to  three  dimensions  or  to  the  Helmholtz  equation  in  two  dimensions,  since  it  relies  heavily 
on  the  harmonic  property  of  the  solution  and  the  techniques  of  complex  analysis. 

Remark  3.  As  observed  by  another  of  referees  to  this  paper,  a  closed  surface  enclosing  very  thin  volumes 
presents  difficulties  closely  related  to  those  associated  with  open  surfaces.  This  class  of  issues  is  not  treated 
in  this  paper. 

The  layout  of  the  paper  is  as  follows.  Section  2  contains  an  informal  description  of  the  procedure.  In 
Section  3,  the  necessary  mathematical  and  numerical  preliminaries  are  introduced.  In  Sections  4,  we  present 
the  principal  analytic  result  of  the  paper.  In  Section  5,  we  describe  a  simple  numerical  implementation  of 
the  scheme.  The  performance  of  the  algorithm  is  illustrated  in  Section  6  with  several  numerical  examples. 
Finally,  in  Section  7  we  discuss  several  generalizations  of  the  approach. 


2.  Informal  description  of  the  procedure 

In  this  section,  we  present  an  informal  description  of  the  procedure.  We  assume  that  y  :  [—  1, 1]  — ►  U2  is  a 
sufficiently  smooth  “open”  (i.e.,  y(-l)  ^  y(l))  curve  with  the  parametrization 

rt*)  =  y(§'(*  +  1))>  (J) 

where  L  is  the  total  arc  length  of  the  curve,  and  y  :  [0,1]  — >  R2  is  the  same  curve  parametrized  by  its  arc 
length.  The  image  of  y  will  be  denoted  by  T.  We  consider  the  Dirichlet  problem  for  the  Laplace  equation  in 
two  dimensions,  with  the  boundary  conditions  specified  on  F,  i.e., 

f  Am  =  0  in  R2  \  T, 

\u  =  f  on  r. 


(2) 
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This  problem  has  a  unique  bounded  solution  if  the  Dirichlet  data  /  is  sufficiently  smooth  (see,  for  example, 
[9,  p.  121]).  The  purpose  of  this  paper  is  to  reduce  the  problem  (2)  to  a  second  kind  integral  equation  on  F. 

The  tools  of  the  classical  potential  theory  by  themselves  do  not  lead  to  such  an  integral  equation.  Indeed,  the 
standard  prescription  (see,  for  example,  [9])  is  to  represent  the  solution  of  a  Dirichlet  problem  by  a  double  layer 
potential,  and  the  solution  of  the  Neumann  problem  by  a  single  layer  potential.  In  either  case,  the  behavior  of 
the  singularity  near  the  boundary  is  such  that  an  integral  equation  of  the  second  kind  on  T  is  obtained. 

However,  the  classical  procedure  critically  depends  on  F  being  a  closed  curve.  Indeed,  the  potential  of  a 
double  layer  on  the  curve  T  experiences  a  jump  when  F  is  crossed;  the  magnitude  of  the  jump  is  equal  to  the 
density  of  the  double  layer  at  the  crossing  point.  This  poses  no  problem  when  the  curve  is  a  closed  one,  since 
the  potential  is  to  be  represented  on  only  one  (inner  or  outer)  side  of  the  curve.  For  an  open  curve,  the 
potential  has  to  be  represented  on  both  sides  of  the  curve;  and  in  most  cases,  the  right-hand  side  /  (viewed 
as  the  limiting  value  of  the  solution  from  both  sides)  has  no  jump  across  F.  Thus,  an  attempt  to  represent 
the  solution  of  (2)  via  a  double  layer  potential  results  in  a  dipole  density  that  is  identically  equal  to  zero. 

One  could  attempt  to  represent  the  solution  of  (2)  by  a  charge  distribution  on  F.  The  resulting  potential 
is  continuous  across  F,  and  algorithms  of  this  type  have  been  constructed  and  used  numerically  (see,  for 
example,  [6]).  However,  the  resulting  integral  equation  is  of  the  first  kind  (though,  fortunately,  with  a 
logarithmically  singular  kernel),  with  all  the  usual  numerical  disadvantages.  Another  option  is  to  use  the 
quadruple  layer  potential  of  the  form 

R(ff)(x)=  [  — ^-I(log||x-}'(/)||)  -ff(0d/,  (3) 

J- 1  oN(t) 

with  N(t)  the  unit  normal  to  T  at  y(f);  the  resulting  equation  is  not  an  integral  equation  at  all,  containing  a 
part  that  is  actually  a  distribution.  In  engineering  literature,  such  objects  are  known  as  “hypersingular 
integral  equation”.  Satisfactory  procedures  have  been  constructed  for  their  numerical  solution  (see,  for 
example,  [3,10,12]);  however,  these  are  not  as  simple  or  as  stable  as  the  many  methods  available  for  the 
solution  of  second  kind  integral  equations. 

This  paper  is  based  on  the  observation  that  when  the  curve  is  the  line  segment  I  =  [-1, 1],  the  right 
inverse  of  the  single  layer  potential  operator  (denoted  by  Sfl)  can  be  constructed  by  simple  analytic  means, 
where  the  single  layer  potential  operator  Sj :  L![— 1, 1]  —>  C[— 1, 1]  is  defined  by  the  formula 

S,(o)(x)  =  J  log  \x-t\-  o(t)  dt.  (4) 

Furthermore,  if  S,-1  is  used  as  a  preconditioner  for  the  single  layer  potential  operator  Sy :  V  [-1, 1]  — ►  C(R2) 
on  T  defined  by  the  formula 

sy(o)  (z)  =  /tog  I,-  y(0l  •  o-(0d'.  (5) 

i.e.,  the  solution  of  the  problem  (2)  is  represented  in  the  form 

«(*)=  SyOSj-'  fo)(x),  (6) 

then  the  resulting  boundary  integral  equation  is  of  the  second  kind. 

Remark  4.  A  stable  second  kind  integral  equation  formulation  has  also  been  developed  for  the  problem  (2) 
in  [7].  Two  key  observations  used  in  [7]  are:  first,  the  product  of  the  quadruple  layer  potential  operator  and 
the  single  layer  potential  operator  is  a  second  kind  integral  operator  for  the  case  of  a  closed  curve;  second, 
the  case  of  a  line  segment  can  be  solved  analytically.  The  integral  representation  for  the  solution  of  the 
problem  (2)  in  [7]  is  of  the  form 
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u(x)  =  Qy0S'0(Q,0S,)-1  (!»)(*),  (7) 

where  Qy  is  the  sum  of  a  quadruple  layer  potential  and  a  weighted  double  layer  potential  with  the  weight 

equal  to  the  curvature,  5/  is  the  single  layer  potential  operator  for  the  line  segment  7=  [-1,1],  and 

(Q,OS,)~'  is  (in  the  appropriate  sense)  the  right  inverse  of  Q,OS,.  The  approach  of  this  paper  differs  from 
that  of  [7]  in  that  the  roles  of  Q  and  S  are  interchanged,  leading  to  a  simpler  scheme.  Indeed,  straight¬ 
forward  analysis  shows  that  the  representation  (6)  is  equivalent  to 

u(x)  =  Sy0Q,0(S,0Q,)-'(ri)(x).  (8) 

In  other  words,  the  solution  of  (2)  is  represented  by  a  single  layer  potential  on  f  preconditioned  by  the 
quadruple  layer  potential  for  the  line  segment  7,  with  a  further  preconditioning  by  the  right  inverse  of 
SjOQi  to  eliminate  the  singularities  at  the  end  points. 


3.  Analytical  preliminaries 

In  this  section,  we  summarize  several  results  from  classical  and  numerical  analysis  to  be  used  in  the 
remainder  of  the  paper.  Detailed  references  are  given  in  the  text. 


3.1.  Chebyshev  polynomials  and  Chebyshev  approximation 

Chebyshev  polynomials  are  frequently  encountered  in  numerical  analysis.  As  is  well  known,  Chebyshev 
polynomials  of  the  first  kind  Tn :  [-1, 1]  -►  R  (n  >  0)  are  defined  by  the  formula 

T„{ x)  =  cos(narccos(x)), 

and  are  orthogonal  with  respect  to  the  inner  product 

1 


if,g)  =  y /(*)•*(*)•’ 


:  dX- 


!  (n  >  0)  are  defined  by  the  formula 


Vl  -x2 

Chebyshev  polynomials  of  the  second  kind  t/„ :  [—1,1] 

sin((w  +  l)arccos(x)) 
sin(arccos(x))  ’ 

and  are  orthogonal  with  respect  to  the  inner  product 
(f,g)  =  y  /(*) -g{x)-Vl  -x2dx. 

The  Chebyshev  nodes  x,  of  degree  N  are  the  zeros  of  TN  defined  by  the  formula 
(2/ +1)71 


Xj  =  cos- 


IN 


/  =  0, 1 , TV  —  1 . 


(9) 

(10) 


(11) 


(12) 


(13) 


For  a  sufficiently  smooth  function  / :  [-1, 1]  -»  R,  its  Chebyshev  expansion  is  defined  by  the  formula 
f(x)  =  '£ck-Tk(x), 


k= 0 


with  the  coefficients  Ck  given  by  the  formulae 


(14) 
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^5 

—  i 

II 

-1 

■  T0(x) 

■  (1  -  x 2)  1//2dx, 

(15) 

Q  _  ^ 

fm 

Ux)  ■ 

N 

7 

«N 

H 

\ 

(16) 

for  all  k  ^  1 .  We  will  also  denote  by  Pj  the  order  N  -  1  Chebyshev  approximation  to  the  function  /  on  the 
interval  [-1,1],  i.e.,  the  (unique)  polynomial  of  order  N  -  1  such  that  Pf{xi)  =  /(*,)  for  all  i  —  0, 1, . . . , 
N  -  1,  with  x,  the  Chebyshev  nodes  defined  by  (13). 

The  following  lemma  provides  an  error  estimate  for  the  Chebyshev  approximation  (see,  for  example,  [4]). 

Lemma  5.  If  f  £  0* [—1,1]  (i.e.,  f  has  k  continuous  derivatives  on  the  interval  [—1,1]),  then  for  any 

xe  [-1,1], 

|//W-/W|  =  o(jjj).  (17) 

In  particular,  if  f  is  infinitely  differentiable,  then  the  Chebyshev  approximation  converges  superalgebraically 
(i.e.,  faster  than  any  finite  power  of  1/N  as  N  — *  oo). 

3.2.  Miscellaneous  results 


In  this  section,  we  collect  several  results  from  classical  analysis  to  be  used  subsequently.  Lemma  6  lists 
two  standard  definite  integrals;  both  can  be  found  (in  a  somewhat  different  form)  in  [5].  Lemma  7  states  a 
standard  fact  from  classical  potential  theory;  it  can  be  found  in  [9].  Finally,  Lemma  8  states  that  if  the  curve 
y  is  sufficiently  smooth,  then  the  restriction  of  the  kernel  of  the  operator  Sy  —  Sj  on  T  is  also  smooth  (see  (4), 
(5)  for  the  definitions  of  Sj  and  Sy). 


Lemma  6.  For  any  x  €  (-1, 1), 
rx  1 

J  ^  log  \x  —  *|  •  -^===  dr  =  -7t  •  log  2, 

and 

p.v.  /  — -  ■  f/„_,(0  •  VT-?dt  =  *  •  Tn(x), 
J- 1  x  ”  1 


for  any  w  >  1 . 


(18) 


(19) 


Lemma  7.  Suppose  that  y  :  [-1, 1]  — ►  R2  is  a  sufficiently  smooth  open  regular  curve  with  the  parametrization 
(1),  and  that  the  function  a  e  1]  satisfies  the  condition 


Then  the  function  u  :  M2  M  defined  by  the  formula 
u(x)  =  f '  log|x-y(/)|  •  cr(t)dt 
is  bounded  in  IR2. 


(20) 


(21) 
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Lemma  8.  Suppose  that  y  €  C*+1[0,Z]  (k  ^  1)  is  an  open  regular  curve  parametrized  by  its  arc  length  in  R2. 
Suppose  further  that  the  function  r  :  [0,  L]  x  [0,  L]  — *  R  is  defined  by  the  formula 


/  _  J  log  b to  -  7(01  ~  log  I*  ~  'l>  *  / 

n  ’  _  \  0,  x  =  t. 

Then  r€  C*([0,L]  x  [0,1]). 

Proof.  Since  y  is  parametrized  by  its  arc  length,  we  have 

I/to  I  =  1, 

for  all  x  E  [0,L].  Combining  (22),  (23),  we  observe  that 
r(x,  t)  =  log  \h(x,t)\, 

where  the  function  h  :  [0,L]  x  [0,L]  — >  R2  is  defined  by  the  formula 


,  s  f  MzM  x±t 

h(x,t)  =  {  *-[  ’  x  +  1' 

\  /to.  x  =  t. 


(22) 


(23) 

(24) 

(25) 


Obviously,  h  is  k  times  continuously  differentiable  for  y  e  C*+1[0,L]  by  Taylor’s  Theorem.  Furthermore, 
since  y(x)  f  y(t)  if  x  f  t,  and  |/(jc)|  =  1  for  all  x  e  [0,  Z.],  we  have 

\h(x,  t)\f0  for  all  (x,  t)  €  [0,L]  x  [0,1].  (26) 

Therefore,  the  function  r  =  log  \h\  is  also  k  times  continuously  differentiable  in  [0,L]  x  [0,1].  □ 


4.  Analytical  apparatus 

4.1.  Right  inverse  of  the  single  layer  potential  operator  on  the  line  segment 

The  purpose  of  this  section  is  Theorem  10,  providing  the  right  inverse  of  the  single  layer  potential 
operator  on  the  line  segment  /  =  [—1, 1].  The  construction  is  based  on  an  elementary  integral  identity  stated 
in  Lemma  9. 

Lemma  9.  For  any  x  e  (-1,1), 

f  log  \x-t\-  -jM=  d t  =  -n-  log  2  •  Toto,  (27) 

and 

[  log  |jc  r|  •  dt  =  ”•  T„(x),  (28) 

J-  i  V 1  - 12  n 

for  any  n  >  1 . 

Proof.  (27)  directly  follows  from  the  combination  of  the  identity  (18)  and  the  fact  that  7o(x)  =  1  for  all 
x  G  [— 1, 1],  To  prove  (28),  we  integrate  by  parts  once,  obtaining 
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J 1  log|x-f| 


m 


d  t: 


•-P-V.  [  -^—■Un-\{t)  ■  vT  -72dr. 
n  j-ix-t 


V\  -r2 

Now,  (28)  follows  from  the  combination  of  (29),  (19).  □ 


(29) 


Theorem  10.  Suppose  that  the  linear  operator  S  :  C[— 1, 1]  — ►  Z)[—  1, 1]  is  defined  by  its  action  on  the  functions 
Tn  (n  S*  0)  via  the  formula 

1  r°M  „  _  n 

(30) 

n  >  0. 


S(Tn)(x)  =  { 


n  •  log  2  Vl  -x2’ 

n  Tn(x) 


n  vT 


Suppose  further  that  the  operator  S/ :  £'[—1, 1]  — *•  C[— 1, 1]  is  defined  by  the  formula 
Sj(a)(x)  =  J '  log  \x  -  r|  •  o(t)dt. 


(31) 


Then 


SjOS  =  J,  (32) 

with  J  the  identity  operator.  In  other  words,  S  is  the  right  inverse  of  Sj  on  the  space  of  continuous  functions . 

Proof.  Since  Tn  (n  >  0)  form  a  basis  for  the  space  C[- 1, 1],  and  the  operators  Sl9  S  are  linear,  we  only  need 
to  prove  that  the  identity 

S,OS(Tn)(x)  =  Tn(x) 

holds  for  all  n  ^  0.  Substituting  (30)  into  (31)  we  obtain 

1  71(0 


S,OS{Tn){x)  =  { 


I 


Vl  —  r2 
dr, 


Combining  (33),  (34),  we  observe  that  it  suffices  to  prove  the  identity 

/ 


(33) 

n  =  0, 

(34) 

n  >  0. 

identity 

log  \x  -  t\  ■  -7==  dr  = 
/-i  v  1  -  r2 


-ju-log2-7o(x),  n  =  0, 

--■T„(x),  n>  0, 

n 


(35) 


which  directly  follows  from  Lemma  9.  □ 

4.2.  Second  kind  integral  equation  formulation 


In  this  section,  we  reduce  Problem  (2)  to  an  integral  equation  of  the  second  kind  on  the  curve  T ;  the 
results  are  summarized  in  Theorem  12.  We  start  with  defining  the  operator  Sy :  C[—  1, 1]  — ►  C(IR2)  via  the 
formula 

Sy(a)(z)  =  SyOS(a)(z),  (36) 

with  Sy,  S  defined  by  (5),  (30),  respectively.  Combining  (36)  with  Theorem  10,  we  easily  see  that  for  ar¬ 
bitrary  smooth  <7  :  [— 1, 1]  — >  (R  and  y(x)  E  f, 

Sy((r)(y(x))  =  S,OS(a)(x)  +  {Sy  -  S,)OS(o)(y(x))  =  o(x)  +  (Sy  -  S/)OS(a)(y(x)),  (37) 

and  the  following  theorem  shows  that  the  operator  Py  =  (Sy  -  Si)OS  is  compact. 
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Theorem  11.  Suppose  that  y  :  [-1, 1]  — >  U2  is  a  sufficiently  smooth  open  regular  curve  with  the  parametri - 
zation  (1).  Suppose  further  that  the  operator  Py  :  C[-l}  1]  — ►  C[— 1,  1]  is  defined  by  the  formula 

Py(ff)(x)  =  (Sy  -  S,)OS(<r){y(x ))  =  J '  (log  |y(x)  -  y(/)|  -  log  \x  -  r|)  •  S(ff)(r)dr,  (38) 

with  Sy,  Si,  S  defined  by  (5),  (31),  (30),  respectively.  Then  Py  is  compact. 

Proof.  By  Lemma  8,  the  function  r  :  [-1, 1]  x  [-1, 1]  — >  (R  defined  by  the  formula 

r(x,  t)  =  log  |y (x)  -  y {t)  |  -  log  \x  - 1\  (39) 

is  k  times  continuously  differentiable  for  any  ye  CM  [-1,1].  Obviously,  if?  is  expanded  into  a  double 
Chebyshev  series 

PC  00 

r{x,  r)  =  E  E  KMx)Tn{t),  (40) 

m= 0  n— 0 


then  there  exists  a  positive  number  C  such  that 


\Kmn\  < 


C 


mk  ■  nk 


(41) 


for  any  m  >  0,  n  >  0.  Now,  for  any  N  >  0,  we  will  define  the  operator  PN  :  C[-l,  1]  — >  C[-l,  1]  by  the 
formula 


Pn{o) W  =  J ^  rN(x,  t )  •  S(<r)  dr,  (42) 

with  the  function  rN  :  [—  1, 1]  x  [—1, 1]  — *  IR  defined  by  the  formula 

N  N 

rN(x,t)  =  £I>-  Tm(x)T„(t).  (43) 

m— 0  «— 0 

Obviously,  Pn  is  a  compact  operator  since  its  range  is  of  finite  dimensionality.  Furthermore,  PN  con¬ 
verges  to  Py  as  N  — ►  oo  by  (41).  Hence,  Py  is  also  a  compact  operator. 

We  will  represent  the  solution  of  Problem  (2)  via  the  formula 

u(x)  =  Sy(o)(x)  +A  =  J  log |x-  y(r)|  •  S(a)(t)dt  +  A,  (44) 

where  ,4  is  a  real  constant  to  be  determined.  Combining  Lemma  7  and  Theorem  11,  we  obtain  the  principal 
result  of  this  paper.  □ 


Theorem  12.  Suppose  that  y  :  [—  1, 1]  — >  R2  is  a  sufficiently  smooth  open  regular  curve  with  the  parametri- 
zation  (1),  and  that  the  function  f  :  [—  1, 1]  — >  !R  is  continuously  differentiable.  Suppose  further  that  the  con¬ 
tinuous  function  a  :  [—  1 , 1]  — »  R  and  the  coefficient  A  satisfy  the  equations 

a(x)  +  Py{o) (x)  =  f(x)  -  A,  (45) 


dx  =  0, 


(46) 
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with  Py  defined  in  (38).  Then  the  function  u  :  IR2  — ►  U  defined  by  (44)  is  bounded  in  IR2  and  is  the  solution  of  the 
problem 

/  Am  =  0  in  IR2  \  T,  /47\ 

\u=f  on  r. 

Remark  13.  Obviously,  the  purpose  of  the  constant  A  in  the  above  theorem  is  to  ensure  the  boundedness  of 
the  solution  u  of  (2).  In  certain  physical  situations,  the  potentials  of  interest  are  not  bounded  at  infinity,  but 
rather  grow  logarithmically.  In  such  cases,  the  solution  to  (2)  assumes  the  form 

u(x )  =  Sr(ff)(x),  (48) 

with  (7  satisfying  the  integral  equation 

<r(x)+Py(<r)(x)=f(x).  (49) 


5.  Numerical  algorithm 


In  this  section,  wc  construct  a  rudimentary  numerical  algorithm  for  the  solution  of  the  Dirichlet  problem 
(47)  via  the  Eqs.  (45)  and  (46).  Since  the  construction  of  the  matrix  and  the  solver  of  the  resulting  linear 
system  are  direct,  the  algorithm  requires  0(N3)  work  and  0(N2)  storage,  with  N  the  number  of  nodes  on 
the  boundary.  While  standard  acceleration  techniques  (such  as  the  Fast  Multipole  Method,  etc.)  could  be 
used  to  improve  these  estimates,  no  such  acceleration  was  performed,  since  the  purpose  of  this  section  (as 
well  as  the  following  one)  is  to  demonstrate  the  stability  of  the  integral  formulation  and  the  convergence 
rate  of  a  very  simple  discretization  scheme. 

By  Theorem  12,  the  equations  to  be  solved  are  (45)  and  (46),  where  the  unknowns  are  the  function  cr  and 
the  real  number  A.  To  solve  (45)  and  (46)  numerically,  we  discretize  the  boundary  into  N  Chebyshev  nodes 
and  approximate  the  unknown  density  cr  by  a  finite  Chebyshev  series  of  the  first  kind, 

ff(t)~^crr*(f),  (50) 

*=0 

with  the  coefficients  Ck  (k  =  0, . . . ,  AT  -  1)  to  be  determined.  In  order  to  discretize  (45),  we  start  with  ob¬ 
serving  that  by  (29),  the  action  of  the  operator  S  on  the  function  a  is  described  via  the  formula 

S(<j)(x)  =  -J=L=  -  Ck  •  Tk(x),  (51) 

VI  -x-  fsS 

where  the  coefficients  Bk  (k  —  0, . . . ,  N  -  1)  are  given  by  the  formulae 


,  B°  7tlog2’ 
n 


(52) 


Next,  we  approximate  the  kernel  r(x,  t)  (see  (40))  of  the  operator  Sy  -  Sj  with  an  expression  of  the  form 


r(x,t) 


N-\  N- 1 


i=o  y=o 


(53) 
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Clearly,  the  coefficients  Ku  have  to  be  determined  numerically,  since  the  curve  T  is  user-specified,  and  is 
unlikely  to  have  a  convenient  analytical  expression.  Thus,  we  obtain  the  coefficients  Ky  by  first  constructing 
the  N  x  N  matrix  R  =  (?(*,  ,  /,))  (ij  =  0, 1, ...  ,N  -  1)  with  xh  tj  the  Chebyshev  nodes  defined  by  (13)  then 
converting  R  into  the  matrix  K  =  (Ky)  ( ij  =  0, 1, . . .  ,N  —  1)  by  the  formula 

K=URUt ,  (54) 

with  N  xN  matrix  U  =  (Uy)  defined  by  the  formula 


f  U0j  =  £ •  To (xj),  j  =  1, 

\u„  =  i-Uxj),  /=!,..  .,N-1,  j  =  0,l,...,N  —  1 , 


Finally,  we  approximate  the  prescribed  Dirichlet  data  /  by  its  Chebyshev  approximation  of  order  jV  -  1 
At)  a  •  m,  (56) 

k—Q 

where  the  coefficients  fk  can  be  obtained  by  first  evaluating  f  at  Chebyshev  nodes  xh  then  applying  to  it  the 
matrix  U  defined  by  (55),  i.e., 

A  =  £  £4  •/(*)•  (57) 

i=0 

Combining  (51),  (53),  (56), we  discretize  (45)  into  the  equation 


with  N  xN  matrix  A  defined  by  the  formula 
A  =  7a(  +  K  ■  B, 

with  IN  the  N  x  TV  identity  matrix,  and  B  the  diagonal  matrix  defined  by  the  formula 

Bij  =  Bi  •  <5 ij. 

Furthermore,  (46)  leads  to  the  equation 

C0  =  0, 

Finally,  (58)  and  (61)  together  form  a  linear  system  of  dimension  N  +  \  to  be  solved. 


Remark  14.  The  generalization  of  the  above  scheme  to  the  case  of  several  disjoint  open  curves  is 
straightforward,  and  has  been  implemented  by  the  authors  (see  Example  4  in  Section  6). 


6.  Numerical  examples 

A  FORTRAN  code  has  been  written  implementing  the  algorithm  described  in  the  preceding  section.  In 
this  section,  we  demonstrate  the  performance  of  the  scheme  with  several  numerical  examples.  We  consider 


S.  Jiang,  V.  Rokhliti  I  Journal  of  Computational  Physics  195  (2004)  1-16 


11 


the  problem  in  electrostatics:  the  boundary  is  made  of  conductor  and  grounded,  the  electric  field  incident  on 
the  boundary  is  generated  by  the  sources  outside  the  boundary.  For  these  examples,  we  plot  the  equipo- 
tential  lines  of  the  total  field  and  present  tables  showing  the  convergence  rate  of  the  algorithm. 


Remark  15.  In  the  examples  below,  the  problems  to  be  solved  via  the  procedure  of  the  preceding  section 
have  no  simple  analytical  solution.  Thus,  we  tested  the  accuracy  of  our  procedure  by  evaluating  our  so¬ 
lution  via  the  formula  (44)  at  a  large  number  M  of  nodes  on  the  boundary  T  (in  our  experiments,  we  always 
used  M  =  2000),  and  comparing  it  with  the  analytically  evaluated  right-hand  side.  We  did  not  need  to  verify 
the  fact  that  our  solutions  satisfy  the  Laplace  equation,  since  this  follows  directly  from  the  representation 
(44). 

In  each  of  those  tables,  the  first  column  contains  the  total  number  N  of  nodes  in  the  discretization  of 
each  curve.  The  second  column  contains  the  condition  number  of  the  linear  system.  The  third  column 
contains  the  relative  L2  error  of  the  numerical  solution  as  compared  with  the  analytically  evaluated  Di- 
richlet  data  on  the  boundary.  The  fourth  column  contains  the  maximum  absolute  error  on  the  boundary.  In 
the  last  two  columns,  we  list  the  errors  of  the  numerical  solution  as  compared  with  the  numerical  solution 
with  twice  the  number  of  nodes,  where  the  solution  is  evaluated  at  1000  equispaced  points  on  a  circle  of 
radius  3.3  centered  at  the  origin;  the  fifth  column  contains  the  relative  L 2  error,  and  the  sixth  column 
contains  the  maximum  absolute  error. 

Example  1.  In  this  example,  the  boundary  is  the  line  segment  parametrized  by  the  formula 


1  Jit)  =  -0.2, 


(62) 


The  Dirichlet  data  are  generated  by  a  unit  charge  at  (0,0).  The  numerical  results  are  shown  in  Table  1.  The 
source,  curve  and  equipotential  lines  are  plotted  in  Fig.  1. 


Example  2.  In  this  example,  the  boundary  is  a  sinusoidal  arc  parametrized  by  the  formula 


f  x(t)  =  0.5/, 

\  y(0  =  cos(r), 


(63) 


The  Dirichlet  data  are  generated  by  one  positive  charge  of  unit  strength  at  (0,  1 .5)  and  another  negative 
charge  of  unit  strength  at  (0,  0).  The  numerical  results  are  shown  in  Table  2.  The  sources,  curve  and 
equipotential  lines  are  plotted  in  Fig.  2. 


Table  1 

Numerical  results  for  Example  1 


N 

K 

w 

E°°(r) 

E™(u) 

8 

0.200E  +  01 

0.703E-01 

0.178E+00 

0.296E-02 

0.528E-02 

16 

0.222E  +  01 

0.759E-02 

0.212E-01 

0.641E-04 

0.114E-03 

32 

0.212E  +  01 

0.165E-03 

0.486E-03 

0.556E-07 

0.991E-07 

64 

0.206E  +  01 

0.147E-06 

0.446E  -  06 

0.835E-  13 

0.150E- 12 

128 

0.203E  +  01 

0.225E-  12 

0.690E-  12 

0.355E- 15 

0.222E  -14 

256 

0.202E  +  01 

0.935E- 15 

0.214E-  13 

0.343E-  15 

0.200E-  14 
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Fig.  1.  Source,  curve,  and  equipotential  lines  for  Example  1. 


Table  2 

Numerical  results  for  Example  2 


N 

K 

£2(n 

£“(r) 

£2(«) 

£°°(u) 

32 

0.195E  +  01 

0.271E-01 

0.864E-01 

0.658E-02 

0.469E  -  02 

64 

0.187E  +  01 

0.240E  -  02 

0.847E  -  02 

0.146E-03 

0.104E-03 

128 

0.182E  +  01 

0.422E  -  04 

0.157E-03 

0.135E-06 

0.955E-07 

256 

0.179E  +  01 

0.307E-07 

0.117E-06 

0.245E-  12 

0.173E-  12 

512 

0.178E  +  01 

0.431E-  13 

0.160E- 12 

0.971E-  15 

0.133E-  14 

1024 

0.177E  +  01 

0.304E-  14 

0.450E- 13 

0.941E-  15 

0.122E-  14 

Example  3.  In  this  example,  the  boundary  is  a  spiral  parametrized  by  the  formula 


f  x(t)  =  tcos(3.3nt)  -  0.1, 
\y(f)  =  tsin(3.3nt), 


0.2<t  <3.2. 


(64) 


The  Dirichlet  data  are  generated  by  a  unit  charge  at  (0, 0).  The  numerical  results  are  shown  in  Table  3.  The 
source,  curve  and  equipotential  lines  are  plotted  in  Fig.  3. 


Example  4.  In  this  example,  we  consider  the  case  of  several  open  curves.  The  boundary  consists  of  three 
elliptic  arcs  parametrized  by  the  formulae 


{ 

{ 


X\  (/)  =  -/cos(3.3j«)  -  1.45, 
yi  (t)  =  -fsin(3.37«)  +0.55, 


0.2  <f<  1.2, 


x2(t)  =  tcos(3.3nt)  -  0.1, 
y2{f)  —  /sin(3.37it)  —  1.2, 


0.2  1.2, 


(65) 


(66) 
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Table  3 

Numerical  results  for  Example  3 


N 

K 

E2(n 

£^(r) 

E2(u) 

32 

0.704E  +  03 

0.594E  -  01 

0.125E  +  00 

0.233E  +  00 

0.685E-01 

64 

0.657E  +  02 

0.108E-02 

0.665E-02 

0.417E-02 

0.201E-02 

128 

0.523E  +  02 

0.904E-04 

0.653E-03 

0.101E-03 

0.575E  -  04 

256 

0.394E  +  02 

0.213E-05 

0.183E-04 

0.179E-06 

0.125E-06 

512 

0.279E  +  02 

0.313E-08 

0.272E  -  07 

0.156E  - 11 

0.123E-  11 

1024 

0.196E  +  02 

0.184E-  13 

0.147E-12 

0.21  IE -13 

0.933E- 14 

f  Xi(t)  =  fcos(3.3itf)  +  1.25, 
\  ;^(f)  =  f  sin(3.3jr?)  +  0.85, 


0.2  1.2. 


(67) 


The  Dirichlet  data  are  generated  by  four  unit  charges  located  at  (0,0),  (1.35, 0.75),  (-1.55,0.75),  (0,  -1.2). 
The  numerical  results  are  shown  in  Table  4,  where  N  is  the  number  of  nodes  on  each  curve.  The  sources, 
curves  and  equipotential  lines  are  plotted  in  Fig.  4. 


Remark  16.  The  above  examples  illustrate  the  superalgebraic  convergence  of  the  scheme  for  smooth  data 
and  curves  (see  Lemmas  5,  8).  The  number  of  nodes  needed  depends  on  the  complexity  of  the  underlying 
geometry  and  the  smoothness  of  the  prescribed  data.  The  condition  number  of  the  resulting  linear  system  is 
usually  very  low. 


7.  Conclusions  and  generalizations 


We  have  presented  a  second  kind  integral  equation  formulation  for  the  Dirichlet  problem  for  the  La¬ 
place  equation  in  two  dimensions,  with  the  boundary  condition  specified  on  a  curve  (consisting  of  one  or 


N 

K 

E\r) 

£«(r) 

E\u) 

£°°(u) 

8 

0.204E  +  02 

0.825E-01 

0.370E  +  00 

0.848E  +  01 

0.451E-01 

16 

0.183E  +  02 

0.180E-01 

0.121E  +  00 

0.259E  +  00 

0.121E-02 

32 

0.145E  +  02 

0.183E-02 

0.1 3  IE  —  01 

0.665E-03 

0.355E-05 

64 

0.116E  +  02 

0.355E-04 

0.455E  -  03 

0.738E-07 

0.252E  -  09 

128 

0.963E  +  01 

0.314E-07 

0.353E-06 

0.302E-11 

0.232E-  13 

256 

0.851E  +  01 

0.51  IE- 13 

0.520E-  12 

0.269E  - 1 1 

0.192E-  13 

more  separate  segments).  The  resulting  numerical  algorithm  converges  superalgebraically  whenever  both 
the  boundary  data  and  the  curves  are  smooth. 

In  order  to  concentrate  on  the  derivation  and  analysis  of  the  integral  formulation,  we  have  chosen  to  use 
a  very  simple  numerical  scheme  (see  Section  5  above);  the  CPU  time  requirements  of  the  procedure  of 
Section  5  scale  as  n3,  with  n  the  number  of  nodes  in  the  discretization  of  the  curve  where  the  boundary 
condition  is  specified.  A  straightforward  combination  of  the  Fast  Multipole  Method  (FMM),  Fast  Fourier 
Transform  (FFT),  and  one  of  many  standard  iterative  solvers  yields  an  order  n  •  log(w)  algorithm;  such  a 
scheme  has  been  implemented,  and  will  be  reported  at  a  later  date.  It  is  also  possible  to  construct  an  order  n 
scheme  via  the  use  of  the  FMM  alone;  according  to  the  authors’  estimates,  for  problems  of  practical  size, 
this  would  offer  no  advantages  over  an  FFT-based  procedure. 

Remark  17.  In  the  iterative  scheme  outlined  above,  each  step  requires  order  n  *  log(«)  operations.  Obvi¬ 
ously,  the  complexity  of  the  scheme  also  depends  on  the  number  of  iterations  needed  to  reach  a  required 
tolerance,  which  is  to  a  large  extent  (though  not  entirely)  determined  by  the  spectral  behavior  of  the  dis¬ 
cretized  system.  As  observed  by  one  of  referees  to  this  paper,  a  critical  question  for  large-scale  problems  is 
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Fig.  4.  Sources,  curves,  and  equipotential  lines  for  Example  4. 


how  the  spectrum  of  the  matrix  A  of  the  discretized  system  (or  the  spectrum  of  A* A  depending  on  the  al¬ 
gorithm  used)  behaves  as  more  and  more  reasonably  separated  curves  of  similar  shapes  are  added  to  the 
geometry.  This  is  currently  under  investigation. 

The  scheme  of  this  paper  can  be  applied  almost  without  modification  to  elliptic  PDEs  other  than  the 
Laplace  equation  (such  as  Helmholtz  equation,  Yukawa  equation,  etc.).  Indeed,  the  Green’s  function  for 
any  such  equation  has  the  form 

G(x,y)  =  <t>(x,y)  •  log(||x  -y||)  +  *l/(x,y),  (68) 

with  (j>,  \j/  a  pair  of  smooth  functions  (see,  for  example,  [2]).  When  the  procedure  of  Section  4  of  this  paper  is 
applied  to  Green’s  function  of  the  form  (68),  the  result  is  unchanged,  except  for  the  change  in  the  compact 
operator  Py  in  (38).  However,  the  convergence  rate  of  the  numerical  scheme  of  Section  5  deteriorates 
drastically,  since  in  this  case  the  kernel  K  of  the  operator  Py  in  (38)  is  logarithmically  singular  (while  for  the 
Laplace  equation,  it  is  smooth).  High-order  discretization  schemes  for  such  integral  equations  can  be  found 
in  the  literature  (see,  for  example,  [1,8,13]). 

Finally,  most  results  of  this  paper  admit  generalizations  to  two-dimensional  surfaces  in  R3;  while  the 
necessary  analytical  apparatus  is  more  involved,  the  results  are  very  similar  to  those  obtained  here.  Spe¬ 
cifically,  the  product  of  a  hypersingular  integral  operator  on  an  open  surface  in  R3  with  the  single  layer 
potential  operators  (either  from  the  left  or  from  the  right)  is  an  integral  operator  of  the  second  kind,  except 
for  simple  corrections  at  the  boundary  of  the  surface.  Such  a  scheme  in  three  dimensions  is  being  imple¬ 
mented,  and  will  be  reported  at  a  later  date. 
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